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DESCRIPTION 



PLASMID 



TECHNICAL FIELD 

The present invention relates to a plasmid vector 
usable in genetic recombinant technology and to a method 
for expressing a gene by using the plasmid vector. The 
present invention also relates to a method for isolating a 
desired gene by using such a plasmid vector. In addition, 
the present invention relates to a restriction enzyme and 
a gene thereof available as a genetic engineering reagent, 
in more detail to the AccIII restriction endonuclease and 
a DNA coding therefor. 
BACKGROUND ART 

In constructing an expression system for a desired 
gene by genetic recombinant technology, expression of the 
gene is controlled by bringing it under control of a 
promoter recognized by the RNA polymerase of the host 
used. In the case of a gene encoding a protein harmful to 
the host, however, plasmid construction itself is 
sometimes hampered by expression of the product of the 
gene due to the inability to stringently control the 
expression of the promoter used. 

As an expression system resolving that problem, the 
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pET system (produced by Novagen) has been developed, which 
uses the RNA polymerase of the bacteriophage T7, which 
infects Escherichia coll, with Escherichia coll as a host 
[journal of Molecular Biology, Vol. 189, pp. 113-130 
(1986); Gene, Vol. 56, pp. 125-135 (1987)]. The P ET 
system is a system that allows T7 RNA polymerase, which 
has high promoter recognition specificity and high 
transcription activity, to be expressed in Escherichia 
coll, which T7 RNA polymerase transcribes a desired gene 
placed downstream of the T7 promoter on an expression 
vector and causes high expression of the gene. Because 
transcription of the desired gene occurs in the presence 
of T7 RNA polymerase, plasmid construction in the host is 
possible without expressing the desired gene, provided 
that the host does not produce the polymerase; plasmid 
construction itself is never hampered, as in cases where 
the expression system is constructed, while the desired 
gene is kept under control of a promoter recognized by the 
RNA polymerase of the host. 

However, because the T7 RNA polymerase gene has been 
cloned onto the X-phage vector and lysogenized into the 
expression host, there is no freedom of host choice; 
painstaking procedures are needed if the host is changed, 
in addition, because the expression of T7 RNA polymerase 
in the host is not stringently controlled, T7 RNA 
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polymerase is expressed even when the host is in a 
non-inductive condition, resulting in expression of the 
desired gene placed downstream of the T7 promoter on the 
expression vector even in a non-inductive condition. To 
suppress such expression of the desired gene in a 
non-inductive condition, T7 RNA polymerase activity is 
inhibited using T7 lysozyme, a T7 RNA polymerase inhibitor 
[Journal of Molecular Biology, Vol. 219, pp. 37-44 
(1991)], or T7 RNA polymerase is prevented from getting 
access to the T7 promoter by placing a lactose operator 
downstream of the T7 promoter [Journal of Molecular 
Biology, Vol. 219, pp. 45-59 (1991)]. 

However, even these countermeasures are 
unsatisfactory in terms of effect against T7 RNA 
15 polymerase of high transcription activity so that the 

activity of T7 RNA polymerase in a non-inductive condition 
cannot be completely inhibited. For this reason, if the 
desired gene product is lethal to the host, it is 
impossible in some cases to prepare a transformant for 
expression of the gene, even when plasmid construction is 
possible. In other words, the pET system involves two 
problems to be resolved: one of the inability to freely 
change the host, and the other of inaccurate control of T7 
RNA polymerase expression. 

On the other hand, there is a bacteriophage having 
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characteristics similar to those of the bacteriophage T7, 
known as the bacteriophage SP6 [Science, Vol. 133, pp. 
2069-2070 (1961)], which infects Salmonella typhimurium. 
The RNA polymerase produced by the bacteriophage SP6, a 
single peptide having a molecular weight of about 100,000, 
is commonly used for in vitro RNA synthesis since it 
possesses high promoter recognition specificity and high 
transcription activity [Journal of Biological Chemistry, 
Vol. 257, pp. 5772-5778 (1982); Journal of Biological 
Chemistry, Vol. 257, pp. 5779-5788 (1982)]. In addition, 
the SP6 RNA polymerase gene has already been cloned and 
expressed in large amounts in Escherichia coll [Nucleic 
Acids Research, Vol. 15, pp. 2653-2664 (1987)]. 

Genes whose expression product acts lethally on hosts 
are exemplified by restriction endonuclease genes. 
Essentially, restriction endonucleases are utilized for 
self-defence by cleaving phages and other exogenous DNA 
entering the cells of microorganisms that produce the 
restriction endonucleases. On the other hand, 
microorganisms that produce restriction endonucleases 
mostly produce modification enzymes that recognize the 
same base sequences as those of the restriction 
endonucleases, to protect their own DNA against cleavage 
by the restriction endonucleases. Specifically, a 
modification enzyme modifies DNA by adding a methyl group 
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to one or more bases in the base sequence recognized 
thereby, to make it impossible for the restriction 
endonuclease that recognizes the same sequence as that of 
the modification enzyme to bind thereto or to cleave the 
5 DNA. This mechanism is called restriction modification 
system, and the pair of genes of the restriction 
endonuclease and modification enzyme that constitute the 
restriction modification system called restriction 
modification system gene. Therefore, when the restriction 
10 endonuclease gene is expressed in a microorganism lacking 
a modification enzyme gene from the restriction 
modification system gene, the microorganism's DNA is 
cleaved, resulting in cell death. In fact, there are two 
modification enzyme genes in the Mbol restriction 
15 modification system gene; it has been reported that 

cloning of restriction endonuclease genes is impossible 
due to incomplete modification of the host DNA in the case 
of incomplete methylation in the co-presence of either 
modification enzyme gene alone [Nucleic Acids Research, 
20 Vol. 21, pp. 2309-2313 (1993)]. 

Also, it has been demonstrated that if a restriction 
modification system gene is lost from a cell retaining the 
restriction modification system gene, a lack of 
modification activity in the cell results in incomplete 
25 methylation of genomic DNA, which in turn causes lethal 



cleavage of its own genomic DNA by a very small amount of 
restriction endonuclease remaining therein [Science, 
Vol. 267, pp. 897-899 (1995)]. In summary, in the absence 
of modification enzymes that constitute a restriction 
5 modification system, restriction endonucleases behave as 
proteins very harmful to cells; separate cloning and 
expression of their genes have been impossible by prior 
art technologies. 

Concerning restriction endonucleases, restriction 
10 endonucleases can be classified by their enzymatic 

properties into three types: I, II and III. Type II 
restriction endonucleases, in particular, each of which 
recognizes a particular DNA base sequence and cleaves it 
at a particular site in or near the sequence, are 
15 extensively used in the field of genetic engineering, and 
restriction endonucleases of this type with various 
specificities have been isolated from a variety of 
microorganisms [Nucleic Acids Research, Vol. 24, pp. 
223-235 (1996)]. In the present specification, a type II 
20 restriction endonuclease is hereinafter referred to as 

"restriction endonuclease". It should be noted, however, 
that some microorganisms produce only small amounts of 
restriction endonuclease, and others produce a plurality 
of restriction endonucleases. For example, the 
25 restriction endonuclease AccIII is produced by 
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Aclnetobacter calcoacetlcus (hereinafter referred to as 
Acc bacterium), which has been deposited under accession 
number PERM BP-935 at the National Institute of Bioscience 
and Human-Technology, Agency of Industrial Science and 
Technology, Ministry of International Trade and Industry 
[address: 1-3, Higashi 1-chome, Yatabemachi, Tsukuba-gun, 
Ibaraki, 305, Japan] since November 9, 1985 (date of 
original deposition), but the amount of the enzyme 
produced is small and this microorganism also produces the 
restriction endonucleases AccI and AccII simultaneously. 
Therefore, advanced production technology is needed to 
provide the restriction endonuclease AccIII as a reagent 
of high purity and low cost using this microorganism. In 
providing a restriction endonuclease as a reagent of high 
15 purity and low cost, it is effective to isolate the 

desired restriction endonuclease gene and selectively 
produce the desired restriction endonuclease in large 
amounts by genetic engineering technology. To accomplish 
this purpose, some methods of isolating restriction 
endonuclease genes have been reported. 

First, there may be mentioned the "shotgun" method, 
wherein the genomic DNA of a microorganism that produces a 
restriction endonuclease is cleaved using the appropriate 
restriction endonuclease, the resulting fragment is 
inserted into an appropriate plasmid vector, and a clone 
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expressing the restriction endonuclease gene is selected. 
Screening methods for desired clones are exemplified by a 
method wherein a restriction modification system gene is 
isolated with resistance to phage infection as an index, 
on the basis of the self-defence function acquired by the 
host upon introduction of the restriction modification 
system gene thereinto [PstI: Proceedings of the National 
Academy of Science of the USA, Vol. 78, pp. 1503-1507 
(1981)]. This method, however, necessitates that the size 
of the restriction modification system gene falls within a 
range allowing its isolation, and that the expression of 
the restriction modification system gene isolated exhibits 
sufficient phage resistance to allow the selective 
survival of the host. On the other hand, as a general 
feature of restriction modification system genes, there 
may be mentioned the close location on the genome of 
restriction endonuclease genes and modification enzyme 
genes; in fact, this has been confirmed in many 
restriction modification system genes that have so far 
been obtained [Nucleic Acids Research, Vol. 19, pp. 
2539-2566 (1991)]. Accordingly, there is a method wherein 
a restriction modification system gene is screened for 
with the expression of a modification enzyme gene as an 
index on the basis of the above-described feature 
[Japanese Patent Laid-Open No. 63-87982; Nucleic Acids 



Research, Vol. 19, pp. 1831-1835 (1991)]. When the 
restriction endonuclease gene is not close to the 
modification enzyme gene, however, this method fails to 
yield the restriction endonuclease gene. 

Furthermore, the above-described "shotgun" method 
poses a fundamental problem associated with a difference 
in transcription-translation mechanism between the genomic 
DNA source organism and the host. For example, in the 
case of insufficient gene expression due to the failure of 
the promoter and ribosome binding site accompanying the 
restriction modification system gene to function well in 
the host, much labor is needed to select transf ormants 
containing the desired gene, even if obtained. To avoid 
this drawback, there is a method wherein the amino acid 
sequence of the restriction endonuclease protein is 
analyzed, the restriction endonuclease gene is obtained 
from the genomic DNA of a microorganism that produces the 
restriction endonuclease by PCR-based DNA amplification on 
the basis of the sequence data obtained, and wherein a 
known protein expression system is utilized [Japanese 
Patent Laid-Open No. 6-277070]. Because the presence of a 
restriction endonuclease is lethal to the host in 
conventional protein expression systems, there is a need 
to protect the host by, for example, allowing a 
modification enzyme that constitutes a restriction 
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modification system together with the enzyme to be 
co-present . 

Although all the above-described methods of the 
isolation of restriction endonuclease genes necessitate 
the simultaneous isolation of the restriction endonuclease 
gene and a modification enzyme gene that constitutes a 
restriction modification system gene together with the 
gene, another method of isolating the restriction 
endonuclease gene alone has been reported [Nucleic Acids 
Research, Vol. 22, pp. 2399-2403 (1994)]. In that method, 
however, it is intended to isolate a gene encoding a 
restriction endonuclease for which optimal temperature for 
enzyme activity is around 70 'C; the co-presence of a 
modification enzyme gene is necessary when the gene to be 
15 isolated encodes a restriction endonuclease showing high 
specific activity near host culturing temperature. 

Exceptionally, there are restriction endonucleases 
that do not show cleavage activity unless a particular 
nucleic acid base in their DNA recognition sequence has 
not been modified by methylation, like the restriction 
endonuclease Dpnl . Genes for restriction endonucleases 
possessing this property are thought to be exceptional in 
that they can be isolated even in the absence of another 
particular gene by selecting the appropriate host 
25 organism. In fact, the mrr gene has been isolated, which 



20 



encodes the Mrr protein, which is not a type II 
restriction endonuclease but which recognizes a particular 
DNA base sequence containing a methylated nucleic acid 
base and exhibits DNA cleavage activity [Journal of 
Bacteriology, Vol. 173, pp. 5207-5219 (1991)]. 

as stated above, isolation of a restriction 
endonuclease gene by the prior art necessitates the 
simultaneous expression of the gene and a modification 
enzyme gene that constitutes a restriction modification 
system gene together with the gene, except for special 
cases . 
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disclosure_ofl™e_^^^ 

Therefore, a first object of the present invention 
provides a plasmid vector capable of isolating such a gene 
that an isolation thereof or a construction of an 
expression system thereof has been difficult in the prior 
arts because the gene product is lethal or harmful to a 
host, and capable of introducing into a host to express 
the protein efficiently. A second object of the present 
invention provides a method for expressing a desired gene 
by using this plasmid vector. A third object of the 
present invention provides a method for isolating a 
desired gene by using this plasmid vector, especially for 
isolating a restriction endonuclease gene without co- 



- 12 - 



existence of a modification enzyme gene constituting a 
restriction modification system. A fourth object of the 
present invention provides a polypeptide possessing an 
activity of an AccIII restriction endonuclease . In 
5 addition, a fifth object of the present invention provides 
a DNA which encodes a polypeptide possessing an activity 
of an AccIII restriction endonuclease. 

First., to resolve the problems in the pET system, the 
present inventors have constructed an expression system 
10 using SP6 RNA polymerase as a new accurate expression 
control expression system, and assessed the system. 

Because the expression system is constructed using a 
system plasmid inserted the SP6 RNA polymerase gene into 
the miniF plasmid, expression systems for the desired gene 
15 can be constructed in various strains of Escherichia coll 
by simultaneously introducing an expression vector 
harboring the desired gene cloned downstream of the SP6 
promoter and this system plasmid into the host. In 
addition, using the lac promoter and antisense technology, 
the present inventors made it possible to exactly control 
the expression of the SP6 RNA polymerase gene on the 
system plasmid. Assessing this expression system using 
the p-galactosidase gene as a reporter gene demonstrated 
that the expression of the SP6 RNA polymerase gene is 
25 accurately controlled in this system, that there is almost 
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no expression of SP6 RNA polymerase in a non-inductive 
condition, and that induction is followed by the 
expression of a sufficient amount of SP6 RNA polymerase to 
efficiently express the desired gene and subsequent 
5 expression of the desired gene at a high level. 

It was also shown, however, that when the host used 
is Escherichia coll, the desired gene downstream of the 
SP6 promoter is expressed in very small amounts even in a 
non-inductive condition because the SP6 promoter is very 
10 weakly but actually recognized by Escherichia coll RNA 

polymerase. It was thus proven that when the gene product 
acts very harmfully and lethally to Escherichia coll, 
expression system construction is impossible so that the 
object of the present invention cannot be accomplished 
15 well. 

With these findings in mind, the present inventors 
have made further extensive investigation and unexpectedly 
found that (i) expression of the desired gene in a 
non-inductive condition can be suppressed to undetectable 

20 levels, and that (ii) expression induction increases the 

copy number of the plasmid containing the desired gene and 
causes RNA polymerase expression, resulting in the 
transcription and translation of the desired gene placed 
downstream of a promoter recognized by the RNA polymerase, 

25 by using a new system for controlling the expression of 
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the desired gene by means of a combination of two control 
methods, i.e., control of the copy number of the gene, and 
transcription control via the promoter. In other words, 
the plasmid vector of the present invention is a plasmid 
vector having unique features to resolve the above 
problems in the field of genetic engineering. The 
present inventors made further investigation based on this 
finding, and developed a method for expressing the desired 
gene using the vector. 

The present inventors also have developed a method 
for isolating a gene whose expression product acts 
lethally on the host, using the above-described plasmid 
vector, more specifically a method of isolating a 
restriction endonuclease gene without the drawbacks of the 
prior art. 

The present inventors also found it possible to 
isolate DNA encoding the AccIII restriction endonuclease, 
which had not been obtained so far, without the 
co-presence of an AccIII modification enzyme, and express 
the AccIII restriction endonuclease in large amounts, 
using the above-described method. 

(!) The gist of the present invention is concerned with: 
A plasmid vector characterized by comprising a promoter 
sequence to control an expression of a desired gene, the 
promoter sequence being recognized by an RNA polymerase 
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not inherent to a host, and a replication origin for 
increasing a copy number by induction with an exogenous 
factor; 

(2) The plasmid vector described in item (1) above, 
5 wherein the promoter sequence is recognized by RNA 

polymerases derived from bacteriophages; 

(3) The plasmid vector described in item (2) above, 
wherein the promoter sequence is recognized by an RNA 
polymerase derived from SP6 phage; 

10 (4) The plasmid vector described in item (3) above, 

wherein the promoter sequence contains the base sequence 
of SEQ ID NO: 30 in the Sequence Listing; 

(5) The plasmid vector described in any one of items (1) 
to (4) above, wherein the replication origin is under 

15 control of a promoter; 

(6) The plasmid vector described in any one of items (1) 
to (5) above, wherein the replication origin is under 
control of the lac promoter; 

(7) The plasmid vector described in any one of items (1) 
20 to (6) above, comprising a drug resistance gene as a 

selection marker; 

(8) The plasmid vector described in item (7) above, which 
is selected from pACE601, P ACE611, pACE701 and pACE702; 

(9) A plasmid vector in which a desired gene to be 
25 expressed is incorporated into the plasmid vector 
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described in any one of items (1) to (8) above; 

(10) A method for expressing a desired gene, characterized 
by introducing into a host a plasmid vector in which the 
desired gene is incorporated into the plasmid vector 
described in any one of items ( 1 ) to ( 8 ) above, and an RNA 
polymerase gene which recognizes a promoter sequence in 
the plasmid vector, and inducing an increase in a copy 
number of the plasmid vector and an expression of the RNA 
polymerase by using an exogenous factor to transcribe and 
translate the desired gene; 

(11) The method for expressing a desired gene described in 
item (10) above, characterized in that the increase in the 
copy number of the plasmid vector and the expression of 
the RNA polymerase are induced by respective exogenous 
factors; 

(12) The method for expressing a desired gene described in 
item (10) above, characterized in that the increase in the 
copy number of the plasmid vector and the expression of 
the RNA polymerase are induced by a same exogenous factor; 

(13) The method for expressing a desired gene described in 
any one of items (10) to (12) above, wherein the exogenous 
factor which induces the increase in the copy number of 
the plasmid vector, is one or more selected from the group 
consisting of an addition of isopropyl-p-D-thiogalactoside 
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(IPTG), an addition of lactose, an addition of galactose, 
an addition of arabinose, a reduction of a tryptophane 
concentration and an adjustment of a transformant 
cultivation temperature; 

(14) The method for expressing the desired gene described 
in any one of items (10) to (12) above, wherein the 
exogenous factor which induces the expression of the RNA 
polymerase, is one or more selected from the group 
consisting of an addition of isopropyl- 0-D-thiogalactoside 
(IPTG), an addition of lactose, an addition of galactose, 
an addition of arabinose, a reduction of a tryptophane 
concentration and an adjustment of a transformant 
cultivation temperature; 

(15) The method for expressing a desired gene described in 
item (10) above, characterized in that the RNA polymerase 
gene is introduced into the host by the other plasmid 
vector or a phage vector; 

(16) The method for expressing a desired gene described in 
item (10) above, characterized in that the RNA polymerase 
gene is incorporated into a chromosome of the host; 

(17) The method for expressing a desired gene described in 
item (15) or item (16) above, characterized in that the 
RNA polymerase gene is derived from SP6 phage; 

(18) The method for expressing a desired gene described in 
any one of items (10) to (17) above, wherein the desired 
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gene encodes a protein lethal or harmful to the host; 

(19) The method for expressing a desired gene described in 
any one of items (10) to (18) above, characterized in that 
Escherichia coll is used as the host; 

(20) A method for isolating a desired gene, characterized 
in that the plasmid vector described in any one of items 
(1) to (8) above is employed in the method for isolating 
the desired gene; 

(21) The method for isolating a desired gene described in 
item (20) above, wherein the desired gene encodes a 
protein lethal or harmful to a host; 

(22) The method for isolating a desired gene described in 
item (21) above, wherein the gene encoding a protein 
lethal or harmful to the host is a restriction 

15 endonuclease gene; 

(23) A polypeptide containing the entire or a portion of 
the amino acid sequence shown by SEQ ID NO : 1 in the 
Sequence Listing, and possessing an activity of AccIII 
restriction endonuclease; 

(24) A polypeptide having an amino acid sequence resulting 
from at least one of deletion, addition, insertion or 
substitution of one or more amino acid residues in the 
amino acid sequence of SEQ ID N0:1 in the Sequence Listing 
or a portion thereof, and possessing an activity of AccIII 

25 restriction endonuclease; 
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(25) A DNA encoding a polypeptide which contains the 
entire or a portion of the amino acid sequence shown by 
SEQ ID N0:1 in the Sequence Listing, and possesses an 
activity of AccIII restriction endonuclease; 
5 (26) A DNA containing the entire or a portion of the DNA 
shown by SEQ ID NO: 2 in the Sequence Listing wherein an 
expression product of the DNA possesses an activity of 
AccIII restriction endonuclease; 

(27) A DNA encoding a polypeptide resulting from at least 
10 one of deletion, addition, insertion or substitution of 

one or more amino acid residues in the amino acid sequence 
of SEQ ID N0:1 in the Sequence Listing or a portion 
thereof, and possessing an activity of AccIII restriction 
endonuclease; and 
15 (28) A DNA capable of hybridizing to the DNA described in 
any one of items (25) to (27) above, and encoding a 
polypeptide possessing an activity of AccIII restriction 
endonuclease . 

20 BRIEF DESCRIPTION OF THE DR AWINGS 

Figure 1 shows the constitution of the model 
^(XL> expression plasmids pMSP6L and pMSP6F. 

Figure 2 shows the constitution of the model 
jJ^flS^ expression plasmid pMSP60. 

25 Figures 3 (A), (B) show the procedure of the 
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construction of the runaway plasmid P HS2870. 

Figu re 4 shows the constitution of the runaway 

plasmid pHS2870. 

Figures 5 (A), (B) show the procedure of the 
construction of the plasmid P CRS04. 

Figure 6 shows the constitution of the plasmid 

pCRS04. 

Figure 7 shows the constitution of the plasmid 
PCRA19. 

Figures 8 (A), <B>, <C. CD). < = > 
of the construction of the system plasmid P FSP6. 

Figures 9 (A), (B) show the procedure of the 
construction of the plasmid pACE601. 

Figure 10 shows the constitution of the plasmid 

pACE611. 

Figures 11 (A), <B) show the procedure of the 
construction of the plasmid pACE611. 

Figures 12 (A), (B) show the procedure of the 
construction of the plasmid pCRS70. 

Figures 13 (A), (B) show the procedure of the 
construction of the plasmids pACE70! and PACE702. 

Figure 14 shows the expression of the activity of Nsp 
7524 III restriction endonuclease depending on the 
induction or non-induction of IPTG. by agarose gel 
electrophoresis of the degradation reaction solution of X- 
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DNA after introducing pFSP6 and pACE601 or pFSP6 and 
P ACE601-NspIII into Escherichia coli HB101. In this 
figure, M indicates the X-EcoT141 size marker; 1 indicates 
Aval digested X-DNA; 2 indicates HB101/pFSP6/pACE601 after 
induction; 3 indicates HBl01/pFSP6/pACE601 before 
induction; 4 indicates HBl01/pFSP6/pACE601-NspIII after 
induction; and 5 indicates HBl01/pFSP6/pACE601-NspIII 
before induction. 

Figure 15 shows a result of an agarose gel 
electrophoresis in the case of 2 and 3 hour-elongation of 
the degradation reaction of X-DNA depending on the 
induction or non- induction of IPTG after introducing P FSP6 
and P ACE601 or pFSP6 and P ACE601-NspIII into Escherichia 
coli HB101. in this figure, M indicates the X-EcoTl41 
15 size marker; 1 indicates Aval digested X-DNA; 2 indicates 
HB101/ P FSP6/ P ACE601 after 1 hour (induction); 3 indicates 
HB101/pFSP6/pACE601 after 2 hours (induction); 4 indicates 
HB101/ P FSP6/pACE601 after 3 hours (induction); 5 indicates 
HB101/pFSP6/pACE601 after 1 hour (non-induction); 6 
indicates HBl01/pFSP6/pACE601 after 2 hours (non- 
induction); 7 indicates HBl01/ P FSP6/pACE601 after 3 hours 
(non-induction); 8 indicates HB101/ P FSP6/pACE601-NspIII 
after 1 hour (induction); 9 indicates HB101/pFSP6/pACE601- 
NspIII after 2 hours (induction); 10 indicates 
25 HB101/pFSP6/pACE601-NspIII after 3 hours (induction); 11 
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indicates HBl01/pFSP6/pACE601-NspIII after 1 hour (non- 
induction); 12 indicates HB101/pFSP6/pACE601-NspIII after 
2 hours (non- induction); and 13 indicates 

HB101/pFSP6/pACE601-NspHI after 3 hours (non-induction). 

Figure 16 shows the construction of the AccIII 
restriction modification system gene. 

BEST MODE FOR CARRY I KTr: ™TT THE INVENTION 

The first aspect of the present invention relates to 
a plasmid vector. More specifically, it relates to a 
plasmid vector characterized in that the plasmid vector 
comprises a promoter sequence, which is recognized by an 
RNA polymerase not inherent to the host, to control the 
expression of the desired gene, and a replication origin 
for increasing the copy number by induction with an 
exogenous factor. The promoter sequence contained in the 
plasmid vector, which sequence controls the expression of 
the desired gene, may be any promoter sequence; promoter 
sequences recognized by particular RNA polymerases, e.g., 
those recognized by RNA polymerases derived from the T7 , 
T3, SP6 and other bacteriophages, can be used, with 
preference given to those recognized by the RNA polymerase 
derived from the SP6 phage. The base sequence of the 
minimum region of the promoter recognized by the RNA 
polymerase derived from the SP6 phage, i.e., the SP6 
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promoter, is shown by SEQ ID NO: 30 in the Sequence 
Listing. Furthermore, the plasmid vector may have a drug 
resistance gene used as a selection marker. 

Exogenous factors that control the copy number of the 
plasmid vector of the present invention, i.e., the 
function of the replication origin of the plasmid vector, 
include, but are not limited to, for example, the addition 
of various chemicals, such as lactose or structurally 
related substances like isopropyl-p-D-thiogalactoside 
(IPTG), galactose or structurally related substances, and 
arabinose or structurally related substances; reduction of 
tryptophane concentration; and adjustment of transformant 
cultivation temperature. By bringing the replication 
origin under control of a promoter that can be controlled 
by these factors, copy number control becomes possible. 
Promoters usable for this purpose include, but are not 
limited to, the lac, trp, tac, gal, are and P L promoters 
etc. when the host used is Escherichia coll, as long as 
the above-described purpose is accomplished. These 
promoters can also be used to induce the expression of an 
RNA polymerase gene. 

Examples of the replication origin of the plasmid 
vector of the present invention include, but are not 
limited to, plasmid vectors that serve as replication 
origins of runaway plasmids under control of the lac 



promoter. In this case, copy number control is achieved 
by the addition of lactose and structurally related 
substances, most preferably by the addition of 
isopropyl-p-D-thiogalactoside ( IPTG) - 

The second aspect of the present invention relates to 
a method for expressing a desired gene to be expressed, 
characterized by introducing a plasmid vector prepared by 
incorporating the desired gene into the plasmid vector of 
the present invention, and an RNA polymerase gene that 
recognizes the promoter sequence on the plasmid vector, 
into a host, and inducing an increase in the copy number 
of the plasmid vector and the expression of the RNA 
polymerase with an exogenous factor to transcribe and 
translate the desired gene. RNA polymerase genes that 
recognize the promoter sequence on the plasmid vector 
include, but are not limited to, for example, the RNA 
polymerase genes derived from the above-mentioned 
bacteriophages, with preference given to the RNA 
polymerase gene derived from the SP6 phage. Exogenous 
factors that induce the expression of such RNA polymerase 
or an increase in the copy number of the plasmid vector 
include, but are not limited to, for example, the addition 
of various chemicals, such as lactose or structurally 
related substances like isopropyl-p-D-thiogalactoside 
(IPTG), galactose or structurally related substances, and 



arabinose or structurally related substances; the 
reduction of tryptophane concentration; and the adjustment 
of transformant cultivation temperature. 

Induction of an increase in the copy number of 
5 plasmid vector and that of expression of RNA polymerase 
may be achieved by the action of respective exogenous 
factors or a same exogenous factor. Also, in this case, 
the gene encoding RNA polymerase may be incorporated onto 
a chromosome of the host, or introduced into the host by a 
10 plasmid vector other than the plasmid vector of the 

present invention or a phage vector. In the latter case, 
the host can readily be changed according to the purpose. 
In that case, introduction of the vector having an RNA 
polymerase gene into the host may precede or follow the 
15 introduction of the plasmid vector of the present 
invention . 

Although the desired gene in the present invention is 
not subject to limitation, it is meaningful when it 
encodes a protein lethal or harmful to the host. Such 
proteins include, for example, the above-mentioned 
restriction endonucleases, other nucleases, nucleic 
acid-binding proteins, and proteases. 

The present invention provides a system that controls 
the expression of a desired gene by a combination of two 
control methods, i.e., control of the copy number of the 
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gene, and control of transcription via a promoter, as 
described above. Control of the expression of the desired 
gene cloned onto a plasmid in the present invention, 
unlike conventional control, is very stringent so that its 
expression can be suppressed to an undetectable level in a 
non-inductive condition. When the copy number of the 
plasmid, i.e., the copy number of the desired gene, is 
increased, with concurrent expression of RNA polymerase, 
by expression induction, the desired gene is transcribed 
and translated under control of the promoter by the action 
of the RNA polymerase. 

It is therefore possible to use the plasmid vector of 
the present invention to prepare a transformant containing 
a gene harmful or lethal to the host and express the gene, 
a task difficult to achieve by the prior art. 

For example, when cloning of a gene encoding the 
Nsp7524 III restriction endonuclease was attempted using 
the plasmid vector of the present invention, the gene was 
successfully retained in Escherichia coll without 
co-presence of the corresponding modification enzyme gene. 
It was also possible to express the Nsp7524 III 
restriction endonuclease in the cells of the resulting 
transformant by introducing a system plasmid containing an 
RNA polymerase gene into the transformant to induce the 
expression of the Nsp7524 III restriction endonuclease. 
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Also, using the plasmid vector of the present 
invention as a cloning vector for preparation of a gene 
library, it is possible to isolate a gene that cannot be 
isolated by a conventional method, and confirm the 
activity of its expression product in a single host. The 
use of the plasmid vector of the present invention is of 
course not limited to the cloning of a gene encoding a 
product harmful to the host, and can be used as a 
general-purpose plasmid vector. 

An RNA polymerase gene relating to the present 
invention can be introduced into a host using, for 
example, the plasmid P FSP6 and the phage Ml3sp6. Details 
of construction of the plasmid pFSP6 are shown in 
Reference Example (2). Escherichia coll HB101 as 
transformed with the plasmid, designated Escherichia coli 
HBl01/pFSP6, has been deposited under accession number 
FERM BP-5742 at the National Institute of Bioscience and 
Human-Technology, Agency of Industrial Science and 
Technology, Ministry of International Trade and Industry 
[address: 1-3, Higashi 1 chome, Tsukuba-shi, Ibaraki-ken, 
305, Japan] since December 22, 1995 (date of original 
deposition). A method for the construction of the phage 
M13sp6 is described in Example 2 (6). 

The present invention is hereinafter described in 
more detail with reference to a case using the plasmid 



pFSP6. 

First described are the results of an experiment 
conducted using the constructed multicopy model expression 
plasmids pMSP6L, pMSP6F (Figure 1) and pMSP60 (Figure 2), 
all having a promoter sequence recognized by SP6 RNA 
polymerase and the p-galactosidase gene as a reporter 
gene, and all allowing the expression of the desired gene 
using the above-described system plasmid pFSP6 . 

Details of construction of these model expression 
plasmids are shown in Reference Example (3). These 
plasmids incorporate the minimum region of the SP6 
promoter (pMSP6L), a inherent SP6 promoter region 
(pMSP6F), or both the minimum region of the SP6 promoter 
and the lac operator region (pMSP60), respectively, as an 
SP6 plasmid sequence. 

When the plasmid pMSP6L or pMSP6F alone was 
introduced into the Escherichia coli strains shown in 
Table 1, p-galactosidase activity was noted, despite the 
fact that the host Escherichia coli did not express SP6 
RNA polymerase, the amounts being higher than that 
obtained with the plasmid pMS434, which does not have the 
SP6 promotor, as shown in Table 2. In short, it can be 
conjectured that the SP6 promoter is recognized by the RNA 
polymerase derived from Escherichia coli, and that the 
p-galactosidase gene downstream thereof is transcribed and 
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translated. Also, when Escherichia coll MRi80 , a strain 
that has a mutation in the pcnB gene and wherein the copy 
number of a plasmid having a replication origin derived 
from ColEl, like the above-described model of expression 
5 plasmids, is reduced by 1/6 to 1/13 of that of the parent 
strain MRi7, was used as a host, the (3-galactosidase 
activity was also decreased to 1/7 to 1/14 of that of 
MRi7. This suggests a correlation between the 
(3-galactosidase gene expression level and the copy number 
10 of the plasmid introduced. 

Table 1 
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Strain 


Genotype 


Origin / Reference 


MC4100 


F- araD139, A (argF-lac)U169, 
thiA, rpsL150, relA1, flbB5301, 
deoC1, ptsF25, rbsR 


Casadaban et a I. 

J. Mol. Biol. 104(1976) 

541-555. 


MR i 7 


MC4100Arbs-7 


Lopi lato et al. 

J. Bacteriol. 158(1984) 

665-673 


MR i 80 


MR i 7 pcnB80 


Lopi lato et al. Mol. 
Gen. Genet. 205(1986) 
285-290. 
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Table 2 





(3 — Galactosidase Activity 


Strain 


Plasmid 




PMSP6F 


PMSP6L 


PMS434 


MC4100 


202 


242 


30 


MR i 7 


234 


287 


23 


MR i 80 


17 


27 


4 
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Next, when the same model of expression plasmids were 
introduced into Escherichia coli together with the 
above-described system plasmid pFSP6 and p-galactosidase 
activity was determined under conditions such that the 
expression of the SP6 RNA polymerase gene was not induced, 
the p-galactosidase activity was at most as high as that 
obtained with Escherichia coli not containing the system 
plasmid, as shown in Table 3. This demonstrates that the 
expression of the SP6 RNA polymerase gene on the plasmid 
pFSP6 is almost completely suppressed in a non-inductive 
condition. On the other hand, when the expression of the 
SP6 RNA polymerase gene was induced by the addition of 
isopropyl-p-D-thiogalactoside (IPTG), p-galactosidase 
activity was increased by 18 to 32 times, demonstrating 
that the p-galactosidase gene can be expressed via the SP6 
promoter. When the host used was Escherichia coli MRi80, 
in particular, p-galactosidase activity was increased to a 
level 25 to 32 times that obtained in a non-inductive 
condition, 4 hours after induction, though the activity in 
a non-inductive condition remained at a very low level. 



Table 3 



Strain 


B - Galactosidase Activity (Non-Induction/ Induct ion) 


PMSP6F 


PMSP6L 


P MS43 


4 


Non- 
Induction 


Induc- 
tion 


Non- 
Induction 


Induc- 
tion 


Non- 
I nduct i on 


Induc- 
tion 


MC4100(pFSP6) 


221 


4875 


234 


5256 


25 


19 


MRi7(pFSP6) 


220 


3958 


242 


4243 


25 


21 


MRi80(pFSP6) 


17 


425 


30 


970 


3 


ND 
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in the plasmid P MSP60, which carries the lac operator 
sequence downstream of the SP6 promoter, the 
p-galactosidase activity under non-inductive conditions is 
lower than that in other model expression plasmids; 
however, its expression cannot be suppressed completely. 
These results suggest that the expression of the desired 
gene is difficult to completely suppress in an expression 
system using a multicopy plasmid, and that controlling the 
copy number of the plasmid containing the desired gene is 
more effective in resolving this problem than controlling 
the expression of the RNA polymerase gene on the system 
plasmid. 

The present inventors thus made further investigation 
hased on these results to explore a new expression vector, 
ana developed the plasmid vector of the present invention. 
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Specifically, the present inventors constructed a 
runaway plasmid vector capable of reducing the expression 
level in a non-inductive condition to ensure sufficient 
expression after induction. First, in consideration of 
the fact that SP6 RNA polymerase expression in a system 
plasmid is induced by IPTG, a plasmid having a runaway 
replication origin allowing an increase in copy number by 
IPTG induction was constructed in accordance with the 
procedures shown in Figure 3. First, the plasmid 
P UC106AdPO is constructed by removing the PvuII fragment 
containing the lac promoter and the operator region from 
the plasmid pUC106A, which is obtained by introducing the 
106 Ndel DNA fragment prepared from the two DNA strands 
whose base sequences are shown by SEQ ID NO: 3 and SEQ ID 
NO: 4 in the Sequence Listing into the Ndel site of the 
plasmid vector pUC19 (produced by Takara Shuzo). 

Next, the plasmid pHS2870 can be obtained by 
inserting the RNA1870 fragment obtained by PCR using the 
RNAIIA primer (base sequence shown by SEQ ID NO: 5 in the 
Sequence Listing) arranged near the RNA I I region 
(replication origin of the plasmid) and the 1870 primer 
(base sequence shown by SEQ ID NO: 6 in the Sequence 
Listing) arranged at a terminus of the 106 Ndel sequence 
with the plasmid as a template, into the Xbal site of the 
plasmid pSTV28 (produced by Takara Shuzo). The plasmid 
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PHS2870 thus constructed is shown in Figure 4. Generally, 
the plasma is present at a copy number of about 30 copies 
per Escherichia coli cell; however, this copy number is 
increased to several hundreds by the addition of IPTG. 

The Accl-Nspl fragment containing the P15A 
replication origin derived from the plasmid P STV28 is then 
removed from the plasmid (plasmid pCRSOl), followed by 
further removal of the EcoRI-xbal fragment containing an 
unnecessary restriction endonuclease site (plasmid 
PCRS02), after which a DNA fragment containing the lactose 
repressor (laclq) gene derived from the plasmid pM0R1560 
[Gene, Vol. 51, pp. 225-267 (1987)] is introduced to yield 
the plasmid pCRS04. A flow diagram of the construction of 
the plasmid P CRS04 from the plasmid P HS2870 is shown in 
Figure 5. The plasmid pCRS04 thus constructed is shown in 
Figure 6. The plasmid is a runaway plasmid induced by 
IP TG; generally, the plasmid is present at a copy number 
of 1 to 2 copies per Escherichia coli cell; however, this 
copy number is increased to several hundreds by the 
addition of IPTG . 

to control the expression of the desired gene 
inserted into the plasmid, the P sp6 - 0l „ EX linker, a 
double- stranded oligonucleotide containing the SP6 
promoter sequence and lac operator sequence, may be 
introduced into the Nhel site of the plasmid to construct 
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the plasmid PACE601. The P^-O^ EX linger is prepared 
from the two DNA strands whose base sequences are shown by 
SEQ ID NO: 7 and SEQ ID NO: 8 in the Sequence Listing. The 
plasmid PACE601 is a runaway plasmid vector having the 
chloramphenicol resistance gene as a selection marker. 

Also, by introducing the BspHI fragment containing 
the p-lactamase gene derived from the plasmid vector 
PUC118 (produced by Takara Shuzo) into the Nhel site of 
the above-described plasmid pCRS04 (plasmid pCRSVO), 
subsequently removing the NcoI-BsaAI fragment containing 
the chloramphenicol resistance gene, and replacing it wrth 
the above-described P„- 0l _ EX linker, the plasmids P ACE701 
and PACE702 can be constructed, which plasmids have the 
linker inserted in mutually opposite directions. These 
plasmids are runaway plasmid vectors having the ampicillin 
resistance gene as a selection marker. 

The potential of the thus-obtained plasmids P ACE601, 
PACE701 and PACE702 used for control of the expression of 
the desired gene can be determined using the 
p-galactosidase gene as a reporter gene, as described 
above. It is possible to construct model expression 
plasmids by introducing a DNA fragment containing the 
p-qalactosidase gene as amplified by PCR using the primers 
tr P A-N-NcoI and lacZ-C-Ncol with the plasmid PMS434 [Gene, 
Vol. 57, PP. 89-99 (1987)] as a template, into the Ncol 



site downstream of the SP6 promoter in each of the 
above-described three plasmids . The base sequences of the 
primers trpA-N-Ncol and lacZ-C-Ncol are shown by SEQ ID 
NO: 9 and SEQ ID NO: 10 in the Sequence Listing, 
respectively. The plasmids thus obtained are designated 
PACE601Z, pACE701Z and pACE702Z, respectively. In 
Escherichia coli transformed with these model expression 
plasmids, absolutely no p-galactosidase activity is 
detected, demonstrating very exact control of the 
expression of the p-galactosidase gene. 

Using the Nsp7524 III restriction endonuclease gene, 
it is possible to confirm that a gene whose expression 
product acts lethally on the host can be isolated and 
expressed using the plasmid of the present invention. The 
plasmid pBRN3 contains the Nsp7524 III restriction 
modification system gene. Escherichia coli MC1061 as 
transformed with the plasmid, designated Escherichia coli 
MC1061/pBRN3, has been deposited under accession number 
FERM BP-5741 at the National Institute of Bioscience and 
Human-Technology, Agency of Industrial Science and 
Technology, Ministry of International Trade and Industry 
[address: 1-3, Higashi 1 chome, Tsukuba-shi, Ibaraki-ken, 
305, Japan] since September 28, 1995 (date of original 
deposition). By a PCR-based DNA amplification reaction 
using a pair of the primers L-ORF and NspR-0RF3 with the 
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plasmid pBRN3 prepared from the transformant as a 
template, a DNA fragment containing the Nsp7524 III 
restriction endonuclease gene alone can be obtained. The 
base sequences of the primers L-ORF and Ns P R-0RF3 are 
shown by SEQ ID NO: 11 and SEQ ID NO: 12 in the Sequence 
Listing, respectively. By introducing into Escherichia 
coll HB101 the plasmid P ACE601-NspIII , which is obtained 
by inserting the resulting DNA fragment into the Ncol site 
downstream of the SP6 promoter of the above-described 
plasmid pACE601, the transformant Escherichia coll 
HB101/ P ACE601-Nsplll can be obtained. The transformant 
allows the host to stably retain the Nsp7524 III 
restriction endonuclease gene despite the absence of the 
Nsp7524 III modification enzyme gene. 

Furthermore, the transformant Escherichia coll 
HBl01/pFSP6/pACE601-NspIII can be prepared by introducing 
the above-described system plasmid pFSP6 into the 
transformant. By culturing the transformant and inducing 
expression by the addition of IPTG at the appropriate 
time, the possibility of production of the Nsp7524 III 
restriction endonuclease in the culture can be confirmed. 

The third aspect of the present invention provides a 
method for isolating a desired gene characterized by using 
the above-described plasmid vector. The isolation method 
25 of the present invention is suitably applied to the 
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isolation of a gene whose expression product is lethal or 
harmful against the host, especially a restriction 
endonuclease gene. By using the present method, it is 
possible to isolate a restriction endonuclease gene that 
5 has not been isolated so far, e.g., the AccIII restriction 
endonuclease gene, even in the absence of the AccIII 
modification enzyme gene. Isolation of a restriction 
endonuclease gene can be achieved not only by the "shotgun 
method", wherein the genomic DNA of a microorganism that 
10 produces the desired enzyme is cleaved with the 

appropriate restriction endonuclease, and the resulting 
DNA fragment is inserted directly into, for example, the 
plasmid pACE611, but also by the use of a cassette library 
of the genomic DNA of a microorganism that produces the 
15 restriction endonuclease. By obtaining a gene by DNA 

amplification with the cassette library as a template, it 
is possible to express a gene prepared from a 
microorganism whose transcription mechanism differs from 
that of the host. The method of isolating a restriction 
endonuclease gene using a cassette library is hereinafter 
described with reference to an example involving the 
AccIII restriction endonuclease. 

A cassette library of the genomic DNA of a 
microorganism that produces the AccIII restriction 
endonuclease, i.e., an Acc bacterium, can be prepared as 
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follows: Genomic DNA is extracted from the cell culture 
of an Acc bacterium, and digested with the appropriate 
restriction endonuclease, after which the resulting DNA 
fragment is ligated to a cassette having a protruding end 
complementary to the fragment. Several similar cassette 
libraries are prepared using different restriction 
endonucleases for genomic DNA cleavage. These libraries 
are generically referred to as an Acc genomic cassette 
library. Next, the desired restriction endonuclease 
protein is purified from the Acc bacterium, its amino acid 
sequence is partially determined, and a primer is 
synthesized on the basis of the sequence. Using this 
primer and cassette primer, a PCR-based DNA amplification 
reaction is carried out with each cassette library as a 
template, to obtain a DNA fragment containing an AccIII 
restriction endonuclease gene fragment. The base sequence 
of the DNA fragment obtained is determined by, for 
example, direct sequencing of the PCR product, to 
determine the full-length base sequence of the AccIII 
restriction endonuclease gene. 

By designing a primer capable of amplifying the 
full-length sequence of the AccIII restriction 
endonuclease gene from the base sequence, and carrying out 
a PCR-based DNA amplification reaction using this primer 
with the genomic DNA of the Acc bacterium as a template, a 
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DNA fragment containing the full-length sequence of the 
AccIII restriction endonuclease gene is obtained. The DNA 
fragment obtained is inserted downstream of the SP6 
promoter of the plasmid pACESll so that the codon frames 
are adjusted, and the resulting recombinant plasmid is 
introduced into, for example. Escherichia coli 3M109, to 
yield transformants. Transf ormants containing the AccIII 
restriction endonuclease gene in an expressible condition 
can be selected by culturing each transf ormant, inducing 
gene expression by the gene expression method of the 
present invention, and determining the AooIII restriction 
endonuclease activity in each culture obtained, as well as 
by drawing the restriction endonuclease map of the plasmid 
DNA harbored by each transf ormant. 

The AccIII restriction endonuclease gene thus 
obtained was actually inserted into the plasmid pACESll, 
and the resulting plasmid was designated pCRA19. The 
restriction endonuclease map of this plasmid is shown in 
Figure 7, wherein the bold solid line indicates the DNA 
fragment containing the AccIII restriction endonuclease 
gene. Escherichia coli aM109 as incorporating the plasmid 
pCRA19. designated Escherichia coli JM109/PCRA19 , has been 
deposited under accession number PERM BP-5743 at the 
National Institute of Bioscience and Human-Technology. 
Agency of Industrial Science and Technology, Ministry of 
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international Trade and Industry [address: 1-3, Higashi 1 
chome, TsuXuba-shi, Ibaraki-ken, 305, Japan] since May 28, 
19 96 (date of original deposition). The transformant 
stably retains the restriction endonuclease gene, despite 
5 the absence of the AccIII modification enzyme gene 

therein. By the above-described method, a restriction 
endonuclease gene can be isolated without the co-presence 
of a modification enzyme gene that constitutes a 
restriction modification system gene. 
L0 The fourth aspect of the present invention provides a 

polypeptide possessing an activity of the AccIII 
restriction endonuclease. 

in the present specification, the term "AccIII 
restriction endonuclease" as a general term of 
15 polypeptides possessing an activity of the AccIII 

restriction endonuclease may be used in some cases. The 
AccIII restriction endonuclease of the present invention 
comprises the amino acid sequence described in SEQ ID NO=l 
in the Sequence Listing, for example. 

The AccIII restriction endonuclease of the present 
invention also includes, but not limited to the above, the 
polypeptide containing the entire or a portion of the 
amino acid sequence described in SEQ ID N0:1 in the 
Sequence Listing and possessing an activity of the AccIII 
restriction endonuclease. Furthermore, the polypeptide 
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having an amino acid sequence resulting from at least one 
of deletion, addition, insertion or substitution of one or 
more amino acid residues in the amino acid sequence of SEQ 
ID N0:1 or a portion thereof in the Sequence Listing and 
possessing an activity of the AccIII restriction 
endonuclease is included in the scope of the present 
invention. 

Generally, a naturally-occurring protein can undergo 
deletion, insertion, addition, substitution and other 
variations of amino acid residues in its amino acid 
sequence due to modifications, etc. of the protein in vivo 
or during purification, as well as those due to 
polymorphism and variation of the gene encoding it. 
Nevertheless, it is known that there are some such 
polypeptides which are substantially equivalent to 
variation-free proteins in terms of physiological and 
biological activity. Thus, those structurally different 
from the corresponding protein, but having no significant 
difference of function or activity from the protein is 
within the scope of the present invention. It is also the 
same when artificially introducing the above variations to 
the amino acid sequence of a protein and in this case, it 
is possible to produce more diverse variants. For 
example, the methionine residue at the N-terminus of a 
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protein expressed in Escherichia coll is reportedly often 
removed by the action of methionine aminopeptidase , but 
the removal is not completely done depending on the kinds 
of proteins, and some such expressed proteins have the 
5 methionine residue and others not. However, the presence 
or absence of the methionine residue does not affect 
protein activity in most cases. It is also known that a 
polypeptide resulting from replacement of a particular 
cysteine residue with serine in the amino acid sequence of 
10 human interleukin 2 (IL-2) retains IL-2 activity [Science, 
224, 1431 (1984)]. 

In addition, in producing a protein by gene 
engineering, the desired protein is often expressed as a 
fused protein. For example, the N-terminal peptide chain 
15 derived from another protein is added to the N-terminus of 
the desired protein to enhance the expression of the 
desired protein, or purification of the desired protein is 
facilitated by adding an appropriate peptide chain to the 
N- or C-terminus of the desired protein, expressing the 
20 protein, and using a carrier showing affinity for the 

peptide chain added. Thus, even if the polypeptide has an 
amino acid sequence partially different from the AccIII 
restriction endonuclease of the present invention, it is 
within the scope of the present invention as long as it 
25 possesses essentially equivalent activity to the AccIII 



- 43 - 



restriction endonuclease of the present invention. 

The AccIII restriction endonuclease can be obtained 
by, for example, culturing the above-described 
transformant Escherichia coll JM109/ P CRA19 , which contains 
the AccIII restriction endonuclease gene, adding the 
inducing agent IPTG at an appropriate time during its 
cultivation to increase the copy number of P CRA19, and 
subsequently infecting with the phage Mi3sp6. 

The AccIII restriction endonuclease can be harvested 
from the transformant culture by, for example, collecting 
cells from the culture, subsequently extracting the enzyme 
by ultrasonic disruption, ultracentrif ugation, etc., and 
then purifying the enzyme by a combination of nucleic acid 
removal, salting-out, affinity chromatography, gel 
filtration, ion exchange chromatography, etc. Because the 
above-described culture, which serves as the starting 
material for this purification, does not contain other 
restriction endonucleases, such as AccI and AccII, an 
enzyme preparation of desired purity can be obtained more 
easily than by conventional purification methods. 

The fifth aspect of the present invention provides a 
DNA encoding a polypeptide possessing an activity of the 
AccIII restriction endonuclease. 

The DNA of the present invention encoding a 
25 polypeptide possessing an activity of the AccIII 
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restriction endonuclease comprises a DNA encoding the 
amino acid sequence described in SEQ ID N0:1 in the 
Sequence Listing, and includes but not limited to, a DNA 
comprising the base sequence described in SEQ ID NO: 2 in 
5 the Sequence Listing, for example. Specifically, the 
following DNAs are within the scope of the present 
invention . 

( ! ) 3 DNA encoding a polypeptide which contains the 
entire or a portion of the amino acid sequence described 
10 in SEQ ID N0:l in the Sequence Listing and possesses an 
activity of the AccIII restriction endonuclease; 

(2) a DNA containing the entire or a portion of the DNA 
shown in SEQ ID NO: 2 in the Sequence Listing, wherein the 
expression product of the DNA possesses an activity of the 

15 AccIII restriction endonuclease; 

(3) a DNA encoding a polypeptide resulting from at least 
one of deletion, addition, insertion or substitution of 
one or more amino acid residues in the amino acid sequence 
of SEQ ID NO:l in the Sequence Listing or a portion 

20 thereof and possessing an activity of the AccIII 
restriction endonuclease; and 

(4) a DNA capable of hybridizing to the DNA described in 
above (1) to (3), and encoding a polypeptide possessing an 
activity of the AccIII restriction endonuclease, etc.. 
25 in addition, if the hybridization is carried out 



under the stringent condition using the above obtained DNA 
as a probe, a similar DNA somewhat different from the 
obtained DNA (SEQ ID NO: 2 in the Sequence Listing) but 
encoding a polypeptide possessing the same enzyme 
activity, can be obtained. Such a DNA is also included in 
the scope of the present invention. 

Such a stringent condition refers to that the 
membrane with DNA immobilized thereon is subjected to 
hybridization with the probe in a solution containing 6 x 
SSC (1 x SSC is a solution of 8.76 g of NaCl and 4.41 g of 
sodium citrate in 1 liter of water), 1% SDS, 100 pg/ml 
salmon sperm DNA, 0.1% bovine serum albumin, 0.1% 
polyvinylpyrrolidone and 0.1% Ficoll, incubating at 65 °C 
for 20 hours, for example. 

Methods for obtaining similar DNA encoding the AccIII 
restriction endonuclease by hybridization include, for 
example, the following method. 

First, DNA obtained from an appropriate gene source 
is ligated to a plasmid or a phage vector by a 
conventional method to yield a DNA library. This library 
is introduced into an appropriate host; the resulting 
transformants are cultured on plates; colonies or plaques 
that have grown are transferred onto nitrocellulose or 
nylon membranes and denatured, after which the DNA is 
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fixed onto the membrane. These membranes are incubated 
for hybridization in a solution of the above-described 
composition containing a probe previously labeled with 32 P 
etc. (probe used may be any polynucleotide encoding the 
entire or a portion of the amino acid sequence shown by 
SEQ ID N0:1 in the Sequence Listing, exemplified by a 
polynucleotide consisting of, or containing, the entire or 
a portion of the base sequence shown by SEQ ID NO: 2 in the 
Sequence Listing) under the conditions shown above. After 
completion of the hybridization, the non-specif ically 
adsorbed probe is washed out, followed by autoradiography 
etc., to identify clones that have hybridized to the 
probe. This procedure is repeated until the desired 
hybridizing clone is isolated. The clone thus obtained 
retains DNA encoding a polypeptide having the desired 
enzyme activity. 

The DNA obtained is determined for base sequence to 
confirm that it encodes the desired enzyme protein, by, 
for example, the method described below. 

For base sequencing, the transformant is cultured in 
a test tube etc. when the host is Escherichia coll, and a 
plasmid is prepared by a conventional method, provided 
that the transformant has been prepared using a plasmid 
vector. Using the plasmid obtained as a template as is, 
or after the insert is taken out and subcloned into the 
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M13 phage vector etc., the base sequence is determined by 
the dideoxy method. In the case of a transformant 
prepared using a phage vector as well, the base sequence 
can be determined by basically the same procedures. 

These basic experimental processes from cultivation 
to base sequencing are described in, for example, 
Molecular Cloning: A Laboratory Manual, 1982, T. Maniatis 
et al., published by Cold Spring Harbor Laboratory. 

Whether or not the DNA obtained is similar DNA 
encoding the desired AccIII restriction endonuclease can 
be confirmed by comparing the determined base sequence 
with the base sequence shown by SEQ ID NO: 2 in the 
Sequence Listing, or by comparing the amino acid sequence 
deduced from the determined base sequence with the amino 
acid sequence shown by SEQ ID N0:1 in the Sequence 
Listing. 

When the DNA obtained does not contain the entire 
portion of the region encoding the desired restriction 
endonuclease, the entire encoding region can be obtained 
by synthesizing a primer on the basis of the base sequence 
of the DNA obtained, and amplifying the lacking region by 
PCR using the primer, or repeating screening the DNA 
library using the DNA fragment obtained as a probe. 

It is possible to prepare a transformant containing 
the thus-obtained similar DNA encoding the AccIII 
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restriction endonuclease, to allow the transformant to 
express the enzyme protein encoded by the DNA, and to 
purify the enzyme protein expressed. Preparation of the 
transformant and expression and purification of the enzyme 
protein can be all achieved using the plasmid of the 
present invention. The enzyme protein thus obtained 
retains AccIII restriction endonuclease activity. 

Additionally, the AccIII modification enzyme and DNA 
encoding the enzyme, both of which have not been obtained 
so far, can also be obtained using the above-described DNA 
encoding the AccIII restriction endonuclease. 

For example, on the basis of the mutually close 
location of a restriction endonuclease gene and a 
modification enzyme gene in many cases, this purpose can 
be accomplished by obtaining a gene region encoding a 
protein near the restriction endonuclease gene by a DNA 
amplification reaction with a cassette library as a 
template, inserting it into an appropriate expression 
vector to allow the gene to be expressed, and confirming 
20 AccIII modification enzyme activity by an appropriate 

method. AccIII modification enzyme activity can also be 
confirmed, for example, on the basis of the resistance of 
the DNA prepared from the transformant to the cleavage 
activity of the AccIII restriction endonuclease. Provided 
25 that the base sequence of the above-described gene region 



15 
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is determined to confirm homology to a conserved region 
between modification enzyme genes of a known restriction 
modification system [Journal of Molecular Biology, Vol. 
206, pp. 305-321 (1989)], it can be anticipated to some 
5 extent before gene expression that the gene is a 
modification enzyme gene. 

By using a cassette library of the genomic DNA of the 
Acc bacterium as described above, the AccIII modification 
enzyme and DNA encoding it can be obtained. Its amino 

10 acid sequence and base sequence are shown by SEQ ID NO: 13 
and SEQ ID NO: 14 in the Sequence Listing, respectively. 

AccIII modification enzyme herein is not limited to 
the above described. As stated in the description of 
AccIII restriction endonuclease, it is a polypeptide 

15 containing the entire or a portion of the amino acid 

sequence described in SEQ ID No: 13 in the Sequence Listing 
and possessing the AccIII modification enzyme activity. 
Furthermore, the polypeptide resulting from at least one 
of deletion, addition, insertion or substitution of one or 

20 more amino acid residues in the amino acid sequence of SEQ 
ID NO: 13 or a portion thereof in the Sequence Listing and 
possessing the AccIII modification enzyme activity is also 
included in the scope of the present invention. 

A DNA encoding AccIII modification enzyme in the 

25 present invention herein comprises a DNA encoding the 
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amino acid nuance described in the SEQ ID NO: 13 in the 
Seguence Listing, and includes, nut not lifted to, a DNA 
comprising the base seguence described in the SEQ ID NO: 14 
in the Seguence Listing, for example. Specifically, the 
following DNAs are within the scope of the present 
invention. 

(1) a DNA encoding a polypeptide containing the entire or 
a portion of the amino acid seguence described in SEQ IO 
HO: 13 in the Seguence Listing and possessing the AcoIII 
modification enzyme activity; 

(2) a DNA containing the entire or a portion of the DNA 
shown in SEQ ID N0:14 in the Seguence Listing, wherein the 
expression product of the DNA possesses the AcoIII 
modification enzyme activity; 

(3 , a DNA encoding a polypeptide resulting from at least 
one of deletion, addition, insertion or substitution of 
one or more amino acid residues in the amino acid seguence 
of SEQ ID NO: 13 or a portion thereof in the Seguence 
Dfsting and possessing the AcoIII modification enzyme 
activity; and 

(4) a DNA capable of hybridizing to the DNA described in 
above (1, to (3). and encoding a polypeptide possessing 
the AccIII modification enzyme activity, etc.. 



The present 



invention is hereinafter described in 




- 51 - 

more detail by means of the following reference example 
and working examples, which examples are not to be 
construed as limitative. Of the procedures described 
herein, basic ones regarding plasmid preparation, 
5 restriction endonuclease digestion, etc. were achieved in 
accordance with the methods described in Molecular 
Cloning: A Laboratory Manual, 2nd edition, edited by T. 
Maniatis et al . , published by Cold Spring Harbor 
Laboratory, 1989. 

10 

Reference Example 

( 1 ) Culture medium and conditions 

Escherichia coli was aerobically cultured at 37 °C 
using LB medium (1% trypton, 0.5% yeast extract, 0.5% 

15 NaCl, pH 7.0). Antibiotics were each added to the medium 
at various concentrations depending on the plasmid 
retained by the Escherichia coli as follows: 25 pg/ml 
kanamycin for pFSP6, 20 pg/ml ampicillin for pXX325, 50 
pg/ml ampicillin for ampicillin-resistant ColEl type 

20 plasmid, and 100 pg/ml ampicillin for ampicillin-resistant 

pUC type plasmid. In the expression induction experiment, 
the culture broth obtained after cultivation until the 
stationary phase was inoculated to a fresh medium at 1%, 
then aerobically cultured at 37 °C, after which 

25 isopropyl-p-D-thiogalactoside (IPTG) was added to a final 




concentration of 0.2 mM upon 
(6 x 10 s cells/ml), followed 
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reach of an OD 600 value of 0.6 
by further cultivation. 



(2) Construction of the system plasmid pFSP6 
5 The system plasmid pFSP6 was constructed with 

Escherichia coli JM109 as a host, according to the 
procedure directed in Figure 8. The plasmid pSP6-2 
[Nucleic Acids Research, Vol. 15, pp. 2653-2664 (1987)] 
was digested with Hindlll (produced by Takara Shuzo) and 

10 blunted at both ends, after which it was further digested 
with BamHI (produced by Takara Shuzo) to yield an about 
2.8 kb DNA fragment containing the SP6 RNA polymerase 
gene, which fragment was mixed with the plasmid vector 
pUC18 (produced by Takara Shuzo), previously digested with 

15 BamHI and Hindi (produced by Takara Shuzo), for ligation 
to construct the plasmid pUCSP . Next, the plasmid 
PMJR1560 [Gene, Vol. 51, pp. 225-267 (1987)] was digested 
with Kpnl and blunted at both ends, after which it was 
further digested with PstI (produced by Takara Shuzo) to 

20 yield an about 1.3 kb DNA fragment containing the laclq 
gene, which was then isolated. The above plasmid pUCSP 
was digested with Hindlll and blunted at both ends, after 
which it was further digested with PstI to yield a DNA 
fragment, which was mixed with the above-described about 

25 1.3 kb DNA fragment for ligation to construct the plasmid 
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pUCSPlac. 

Furthermore, an about 1.5 kb DNA fragment obtained by 
PstI digestion of pUCKm [Journal of Molecular Biology, 
Vol. 147, pp. 217-226 (1981)], which contains the 
5 kanamycin resistance gene, was introduced into the PstI 

site of the above-described plasmid pUCSPlac to construct 
the plasmid pUCSPlacKm. The plasmid was digested with 
Aatll (produced by Toyobo ) , after which it was partially 
digested with Nspl (produced by Takara Shuzo) to isolate 
10 an about 6.5 kb DNA fragment, which was then blunted at 

both ends. 

On the other hand, the plasmid pXX325 [Proceedings of 
the National Academy of Sciences of the USA, Vol. 80, pp. 
4784-4788 (1983)] was digested with Hindlll and blunted at 

15 both ends, after which it was further digested with Sail 
to isolate an about 6.8 kb DNA fragment containing the 
replication origin of the minxF plasmid, which fragment 
was then mixed with the above-described about 6.5 kb DNA 
fragment for ligation to construct the plasmid pFSP6 . 

20 Escherichia coli HB101 was transformed with the plasmid to 

form Escherichia coli HB101/pFSP6. 



25 



(3) Construction of multicopy model expression plasmids 

The P sp6 linker, a double-stranded oligonucleotide 
containing the minimum region of the SP6 promoter, was 
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prepared from the two DNA strands whose base sequences are 
shown by SEQ ID NO: 15 and SEQ ID NO: 16 in the Sequence 
Listing, respectively, and was mixed with the plasmid 
pMS434 [Gene, Vol. 57, pp. 89-99 (1987)], previously 
5 digested with Xhol (produced by Takara Shuzo ) and Hindlll, 

for ligation to construct the model expression plasmid 
pMSP6L inserted the above-described promoter sequence 
upstream of the p-galactosidase gene on the plasmid 
(Figure 1). 

10 On the other hand, by inserting a DNA fragment 

containing the inherent SP6 promoter sequence, obtained by 
digesting pSP64 [Nucleic Acids Research, Vol. 12, pp. 
7035-7056 (1984)] with AccII and Hindlll, into the 
Xhol-Hindlll site of the above-described plasmid pMS434, 

15 the model expression plasmid pMSP6F was constructed 
(Figure 1). Furthermore, the P SP6 -0 lac linker, which 
contains the minimum region of the SP6 promoter and the 
lac operator region, was prepared from the two DNA strands 
whose base sequences are shown by SEQ ID NO: 17 and SEQ ID 

20 NO: 18 in the Sequence Listing, respectively, and was 
inserted between the Xhol -Hindi I I site of the 
above-described plasmid pMS434 to construct the model 
expression plasmid pMSP60 (Figure 2). 

25 (4) Promoter activity of the SP6 promoter for Escherichia 
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coll RNA polymerase 

The plasmids pMSP6L and pMSP6F constructed in 
Reference Example (3) and the plasmid pMS434 as a control, 
which does not contain the SP6 promoter sequence, were 
5 each introduced into the Escherichia coll shown in Table 
1. The resulting transf ormants were each cultured by the 
method described in Reference Example (1), after which 
each cell culture broth was harvested during the 
logarithmic growth phase. The OD 600 value of each culture 

10 broth collected was determined with a portion thereof, 
while the remaining portion was transferred to a 
pre-cooled test tube and supplemented with chloramphenicol 
to a final concentration of 100 mg/ml . With this culture 
broth as a sample, (3-galactosidase activity was determined 

15 by the method described in Experiments in Molecular 

Genetics, edited by J.H. Miller, published by Cold Spring 
Harbor Laboratory, 1972. 

As shown in Table 2, Escherichia coll MC4100 
incorporating the model expression plasmid pMSP6L or 

20 pMSP6F, both harboring a sequence containing the SP6 

promoter, exhibited similar level of low (3-galactosidase 
activity, irrespective of the promoter sequence inserted. 
However, that incorporating the plasmid pMS434, which does 
not contain the SP6 promoter, exhibited only lower enzyme 

25 activity. In addition, comparing Escherichia coll MRi7 
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and MRi80 as the hosts revealed decreased p-galactosidase 
activity in MR180, wherein the copy number of the plasmid 
was also decreased. 

5 ( 5 ) Evaluation of the system plasmid pFSP6 

Each transformant prepared in Reference Example (4) 
into which the system plasmid pFSP6 was further 
introduced, was cultured by the method described in 
Reference Example (1); p-galactosidase activity was 

10 determined both in a non-inductive condition and in an 

inductive condition before and after IPTG addition. The 
results were shown in Table 3. All transf ormants 
exhibited p-galactosidase activity in a non-inductive 
condition at a level similar to that obtained in the 

15 absence of the plasmid pFSP6 shown in Table 2. 

On the other hand, with induction by IPTG addition, 
the p-galactosidase activity in the transf ormants 
incorporating the plasmids pMSP6L and pMSP6F was 18 to 32 
times compared with that obtained in a non-inductive 

20 condition, while there was no increase in the activity in 

the transf ormants incorporating the plasmid pMS434. 



25 



(6) Effect of the lac operator sequence on expression 
control 

Transf ormants obtained by introducing the plasmids 
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pMSP6L, pMSP6F, pMSP60 and pMS434, respectively, into 
Escherichia coli MC4100 incorporating the plasmid pFSP6 
were each cultured by the method described in Reference 
Example (1), and (3-galactosidase activity in a 
5 non-inductive condition was determined. The results are 

shown in Table 4. The transformant incorporating the 
plasmid pMSP60, which contains the lac operator sequence , 
still showed some p-galactosidase activity, although the 
activity level was lower than those obtained with the 
10 plasmids pMSP6L and pMSP6F. In short, expression in a 

non-inductive condition could not be completely suppressed 
simply by introduction of the lac operator sequence. 

Table 4 



Strain 


0 — Galactosidase Activity (Non-inductive Condition) 


PMSP6F 


pMSP6L 


PMSP60 


PMS434 


MC4100(pFSP6) 


221 


234 


30 


25 



20 

Example 1 

(1) Culture medium and conditions 

Escherichia coli was aerobically cultured at 37 °C 
using LB medium (1% trypton, 0.5% yeast extract, 0.5% 
25 NaCl , pH 7.0). Antibiotics were each added to the medium 
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at various concentrations depending on the plasmid 
retained by the Escherichia coli as follows: 25 ug/ml 
kanamycin for pFSP6, 50 ug/ml ampicillin for 
ampicillin-resistant ColEl type plasmid, 100 pg/ml 
5 ampicillin for ampicillin-resistant pUC type plasmid, 
30 pg/ml chloramphenicol for chloramphenicol-resistant 
runaway plasmid and 30 pg/ml ampicillin for ampicillin- 
resistant runaway plasmid. In the expression induction 
experiment, the culture broth obtained after cultivation 
10 until the stationary phase was inoculated to a fresh 

medium at 1%, then aerobically cultured at 37°C, after 
which isopropyl-p-D-thiogalactoside (IPTG) was added to a 
final concentration of 0.2 mM upon reach of an OD 600 value 
of 0.6 (6 x 10 8 cells/ml), followed by further cultivation. 

15 

(2) Construction of the runaway plasmid pHS2870 

The runaway plasmid pHS2870, which provided a basis 
for construction of the expression plasmid, was 
constructed by the procedures shown in Figure 3. The 106 

20 Ndel DNA fragment prepared from the two DNA strands whose 
base sequences are shown by SEQ ID NO : 3 and SEQ ID NO : 4 in 
the Sequence Listing, respectively, was mixed with the 
plasmid vector pUC19, previously digested with Ndel, for 
ligation. The plasmid pUC106A thus obtained was digested 

25 with PvuII and subjected to self -ligation to yield the 
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yield the plasmid pUC106AdPO. With this plasmid 
pUC106AdP0 as a template, PCR was then conducted using the 
RNAIIA primer (whose base sequence shown by SEQ ID NO: 5 in 
the Sequence Listing) and the 1870 primer (whose base 
5 sequence shown by SEQ ID NO : 6 in the Sequence Listing) to 
yield an amplified DNA fragment, which was then digested 
with Xbal and mixed with the plasmid pSTV28, previously 
digested with Xbal, for ligation to yield the plasmid 
pHS2870. The construct of the plasmid pHS2870 is shown in 
10 Figure 4. 

(3) Construction of the runaway plasmid pCRS04 

After digestion with AccI (produced by Takara Shuzo ) 
and Nspl, the above-described plasmid pHS2870 was blunted 

15 at both ends and subjected to self -ligation to yield the 

plasmid pCRSOl, which lacks the P15A replication origin. 
The plasmid was then digested with EcoRI (produced by 
Takara Shuzo) and Xbal, after which it was blunted at both 
ends and subjected to self -ligation in the same manner as 

20 above to construct the plasmid pCRS02 . After an about 1.2 
kb DNA fragment obtained by digesting the plasmid pMJR1560 
with Kpnl (produced by Takara Shuzo) and PstI was blunted 
at both ends, it was mixed with the above-described 
plasmid pCRS02, previously digested with NspV and Vspl 

25 (both produced by Takara Shuzo) and then blunted at both 
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ends, for ligation to yield the plasmid pCRS04. The flow 
diagram of the construction of the plasmid pCRS04 is shown 
in Figure 5, and the construct of the plasmid pCRS04 shown 
in Figure 6. 

5 

(4) Construction of the runaway expression vector pACE601 
The above-described plasmid pCRS04, previously 

digested with Nhel (produced by Takara Shuzo) and 
subsequently blunted at both ends, was mixed with the 

10 p sp6 - Oiac EX linker, prepared from the two DNA strands whose 

base sequences are shown by SEQ ID NO: 7 and SEQ ID NO: 8 in 
the Sequence Listing, for ligation to construct the 
plasmid pACE601 . The flow diagram of the construction of 
the plasmid pACE601 is shown in Figure 9. The 

15 construction of these plasmids were conducted with 
Escherichia coli JM109 as a host. 

(5) Construction of the runaway expression vector pACE611 
The runaway expression vector pACE601 was digested 

20 with Xhol-Hindlll , and the P sp6 linker, consisting of the 
synthetic oligo-DNAs shown by SEQ ID NO: 15 and SEQ ID 
NO: 16 in the Sequence Listing, was inserted into that site 
to construct the runaway expression vector pACE611. The 
construct of pACE611 is shown in Figure 10. The flow 

25 diagram of the construction of the plasmid pACE611 is 
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shown in Figure 11. 

(6) Construction of the runaway expression vectors 
pACE701 and pACE702 
5 An about 1 kb DNA fragment obtained by digesting the 

plasmid vector pUC118 (produced by Takara Shuzo) with 
BspHI (produced by NEB) was blunted at both ends., after 
which it was mixed with the above-described plasmid 
pCRS04, previously digested with Nhel , and blunted at both 

10 ends, for ligation to construct the plasmid pCRS70. The 
flow diagram of the construction of the plasmid pCRS70 is 
shown in Figure 12. Next, the plasmid was digested with 
Ncol (produced by Takara Shuzo) and BsaAI (produced by 
NEB), after which it was blunted at both ends and mixed 

15 with the above-described P SP6 -0 lac EX linker for ligation to 

construct two plasmids pACE701 and pACE702, which 
incorporate the linker inserted in mutually opposite 
directions. The flow diagrams of the construction of the 
plasmids pACE701 and pACE702 are shown in Figure 13. The 

20 construction of these plasmids were conducted with 

Escherichia coll JM109 as a host. 



25 



(7) Construction of model runaway expression plasmids 

PCR was conducted using the primers trpA-N-Ncol and 
lacZ-C-Ncol with the above-described plasmid pMS434 as a 
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"template, to yield a DNA fragment containing the 
p-galactosidase gene. The base sequences of the primers 
trpA-N-Ncol and lacZ-C-Ncol are shown by SEQ ID NO: 9 and 
SEQ ID NO: 10 in the Sequence Listing, respectively. After 
5 digestion with Ncol, the fragment was mixed with each of 

the above-described plasmids pACE601, pACE701 and pACE702, 
all previously digested with Ncol, for ligation to yield 
the model runaway expression plasmids pACE601Z, pACE701Z 
and pACE702Z, all incorporating the p-galactosidase gene 
10 introduced downstream of the P SP6 -0 lac sequence. The 

construction of these plasmids were conducted with 
Escherichia coli JM109 as a host. 

(8) Evaluation of the expression levels of runaway 
15 expression plasmid in a non-inductive condition 

The transformants MC4100/pFSP6/pACE601Z , 
MC4100/pFSP6/pACE701Z and MC4100/pFSP6 /pACE702Z , which 
were obtained after introducing the above-described model 
runaway expression plasmids pACE601Z, pACE701Z and 
20 pACE702Z, respectively into Escherichia, coli MC4100 

incorporating the above-described system plasmid pFSP6 
(hereinafter referred to as MC4100/PFSP6 ) , were each 
cultured under the conditions described in Example (1); 
each culture broth collected during the logarithmic growth 
25 phase was assayed for p-galactosidase activity by the 
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method described in Reference Example (4). In all 
transformants examined, p-galactosidase activity was below 
the detection limit, demonstrating a greater 
expression-suppressing effect than that obtained with the 
5 multicopy plasmids shown in Reference Example (4). 

(9) Construction of the Nsp7524 III restriction 
endonuclease gene expression plasmid 
The plasmid pBRN3 , which contains the Nsp7524 III 

10 restriction modification system gene, was prepared from 
Escherichia coll MC1061/pBRN3 ( FERM BP-5741). PCR was 
conducted using the primers L-ORF and NspR-0RF3 with this 
plasmid as a template to yield an about 1 kb DNA fragment 
containing the Nsp7524 III restriction endonuclease gene 

15 alone. The base sequences of the primers L-ORF and 

NspR-ORF3 are shown by SEQ ID NO: 11 and SEQ ID NO: 12 in 
the Sequence Listing, respectively. After digestion with 
Ncol, the fragment was mixed with the above-described 
plasmid pACE601, previously digested with Ncol, for 

20 ligation to construct the plasmid pACE601-NspIII , which 

incorporates the Nsp7524 III restriction enzyme gene alone 
introduced downstream of the P SP6 -0 lac sequence. The plasmid 
was stably retained in Escherichia coll JM109 not 
containing the Nsp7524 III modification enzyme gene. 



25 
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(10) Construction of Nsp7524 III restriction endonuclease 
gene expression system 

The transforraants HB101/pFSP6/pACE601-NspIII and 
HB101/pFSP6/pACE601, which were obtained after introducing 
5 the above-described plasmid pACE601 -Nspl I I , which contains 
the Nsp7524 III restriction endonuclease gene, and the 
control plasmid pACE601 into Escherichia coll HB101 
incorporating the above-described system plasmid pFSP6 
(hereinafter referred to as HB101/PFSP6 ) , were each 

10 cultured in LB medium until the stationary phase; each 
culture broth collected was inoculated to two tubes of 
fresh medium at 1% and aerobically cultured at 37°C. Upon 
reach of an OD 600 value of 0.6, IPTG was added to one of 
the tubes to a final concentration of 0.2 mM, followed by 

15 further cultivation. After completion of the cultivation, 

cells were harvested, suspended in cell disruption buffer 
A ( 20 mM Tris-HCl, pH 7 . 5 , 10 mM 2-mercaptoethanol ) , and 
disrupted by ultrasonication, followed by centrif ugation 
to yield a crude extract. 

20 A 1 ul portion of this crude extract was added to 30 

pi of a reaction mixture (10 mM Tris-HCl, pH 7 . 5 , 10 mM 
MgCl 2 , 1 mM DTT, 50 mM NaCl , 1 pg X-DNA ) and reacted at 
37 °C for 1 hour, after which the reaction mixture was 
subjected to agarose gel electrophoresis to examine for 

25 the digestion of X-DNA and confirm restriction 
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endonuclease activity. As shown in Figure 14, restriction 
endonuclease activity was observed only in 

HB101/pFSP6/pACE601-NspIII with expression induced by the 
addition of IPTG, and its A.-DNA cleavage pattern agreed 
5 with that of Aval, an isoschizomer of Nsp7524 III. 

The results of the above-described X-DNA digestion 
reaction as conducted for extended periods of 2 and 3 
hours are shown in Figure 15. In this case, because the 
crude extract used in the present experiment contained 

10 nuclease derived from the host Escherichia coli, the DNA 
fragment resulting from Nsp7524 III activity underwent 
further degradation by Escherichia coli nuclease, 
resulting in bands whose density decreased as the increase 
in reaction time, on lanes 8 through 10, with no bands 

15 detected on lane 10. On lanes 11 through 13, the Nsp7524 
III digestion fragment was not produced because of the 
absence of Nsp7524 III induction even when reaction time 
was extended, demonstrating that the X-DNA was made into 
the lower molecular weight fragments by the direct action 

20 of Escherichia coli nuclease. No restriction endonuclease 
activity was detected in the crude extract from 
HB101/pFSP6/pACEA601 . 

Example 2 

25 Isolation and expression of the AccIII restriction 
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endonuclease gene 

(1) Determination of N-terminal amino acid sequence of 

AccIII restriction endonuclease protein and synthesis 
of primer DNA corresponding to the amino acid 
5 sequence 

About 2 ku of a commercial product of the AccIII 
restriction endonuclease (produced by Takara Shuzo) was 
subjected to gel filtration using a column of Sephacryl 
S300 (produced by Pharmacia) to determine the molecular 
10 weight of the AccIII restriction endonuclease. Judging 

from the elution position where activity was detected, the 
molecular weight of the AccIII restriction endonuclease 
was proven to be about 70,000. Because most restriction 
endonuclease proteins are dimers, the AccIII restriction 
15 endonuclease protein was expected to be mobilized to a 
position for a molecular weight of about 35,000 in SDS 
polyacrylamide gel electrophoresis. 

Next, to obtain a sample for N-terminal amino acid 
sequencing of the AccIII restriction endonuclease protein, 
20 about 2 ku of a commercial product of the AccIII 

restriction endonuclease, together with a protein 
molecular weight marker, was subjected to SDS 
polyacrylamide gel electrophoresis, after which the 
protein was transferred from the gel to a PVDF membrane 
25 and stained with bromophenol blue to confirm the protein 
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position. After destaining with 10% acetic acid-50% 
methanol, the portion of the PVDF membrane where a protein 
of about 35,000 molecular weight was blotted was cut out 
and subjected to automatic Edman degradation using the 
5 protein sequencer G1000A (produced by Hewlett-Packard) to 

determine the N-terminal amino acid sequence shown by SEQ 
ID NO: 19 in the Sequence Listing. On the basis of this 
sequence, AccIII primers 1 and 2, shown by SEQ ID NO: 20 
and SEQ ID NO: 21 in the Sequence Listing, respectively, 
10 were then synthesized for use as a pair of cassette 
primers . 

(2) Preparation of genomic DNA of Acc bacterium 

In accordance with the method of Kita et al . , 

15 described in Nucleic Acids Research, Vol. 13, pp. 

8685-8694 (1985), the Acc bacterium was cultured to obtain 
wet cells. Two grams of the wet cells obtained was 
suspended in 10 ml of buffer B [25 mM Tris-HCl ( pH 8.0), 
50 mM glucose, 10 mM EDTA] , stirred in the presence of 1 

20 ml of a lysozyme solution prepared to 2 mg/ml in buffer B, 
and kept standing at 37 °C for 20 minutes. Next, 28 ml of 
buffer C [100 mM NaCl, 100 mM Tris-HCl ( pH 8.0)] was added 
to this solution, followed by stirring. One milliliter of 
a proteinase K solution prepared to 20 mg/ml in TE [10 mM 

25 Tris-HCl, 1 mM EDTA ( pH 8.0)] and 4 ml of a 10% SDS 
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solution were further added, followed by stirring, after 
which the mixed solution was kept standing at 37 °C for 1 
hour . 

To this solution, 6 ml of a 5 M aqueous solution of 
5 NaCl and 6 ml of buffer D [10% CTAB ( cetyl trimethyl 
ammonium bromide), 0.7 M NaCl] was added, followed by 
stirring, after which the mixed solution was kept standing 
at 60 °C for 20 minutes. This solution was treated with 
phenol and subsequently with chloroform, after which the 

10 water layer was separated. To the water layer, 50 ul of 
an RNaseA (produced by Sigma) solution prepared to 10 
mg/ml in TE was added, followed by stirring, after which 
the mixed solution was kept standing at 37 °C for 40 
minutes. After being kept standing, this solution was 

15 treated with phenol and subsequently with chloroform, 

after which the water layer was separated. To the water 
layer, an equal volume of cold ethanol was added; the DNA 
precipitated was recovered by winding around a glass 
capillary. The DNA was washed with 70% ethanol and 

20 dissolved in 3 ml of TE to yield about 200 ug of genomic 

DNA. 



25 



(3) Preparation of Acc genomic cassette library 

The following procedures were conducted basically in 
accordance with the method described on pages F16-F17 in 
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"Gene Engineering Guide" (1995-1996 edition), Takara 
Shuzo . 

The genomic DNA obtained in (2) was completely 
digested with EcoRI (produced by Takara Shuzo); the DNA 
5 fragment obtained was ligated with an EcoRI cassette 

(produced by Takara Shuzo), having a cohesive end 
complementary thereto, to yield an EcoRI cassette library. 
Similarly, the genomic DNA was completely digested 
separately with the restriction endonucleases Bglll, 

10 EcoT14l, EcoT22I, Hindlll, PstI, Sail and Xbal (all 

produced by Takara Shuzo). The genomic DNA fragments 
obtained were each bound to each of several cassettes 
(produced by Takara Shuzo), having a complementary 
protruding end, to yield the Bglll, EcoT14l, EcoT22l, 

15 Hindlll, PstI, Sail and Xbal cassette libraries, 

respectively. These cassette libraries are generically 
referred to as the Acc genomic cassette library. 

(4) Analysis of the AccIII restriction endonuclease gene 
20 A PCR-based DNA amplification reaction was carried 

out using a primer pair of AccIII primer 1 and cassette 
primer CI (produced by Takara Shuzo) with the EcoRI 
cassette library obtained in (3) as a template. To 
efficiently and specifically amplify the desired region, a 
25 second PCR-based DNA amplification reaction was carried 
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out using a primer pair of AccIII primer 2 and cassette 
primer C2 (produced by Takara Shuzo) with a portion of the 
reaction liquid obtained, to yield an amplified DNA 
fragment. Similarly, with the Bglll, EcoT14l, EcoT22l, 
5 Hindlll, PstI, Sail and Xbal cassette libraries as a 

template respectively, the above two-step PCR-based DNA 
amplification reaction was carried out separatedly to 
yield amplified DNA fragments. Each amplified DNA was 
analyzed by agarose gel electrophoresis; the amplified DNA 

10 fragment derived from the Xbal cassette library showed a 

particularly high amplification efficiency. 

With this in mind, a PCR-based DNA amplification 
reaction was again carried out using a primer pair of 
AccIII primer 1 and cassette primer CI with the Xbal 

15 cassette library as a template. This reaction mixture was 
subjected to agarose gel electrophoresis; an about 0.5 kb 
amplified DNA fragment was recovered from the gel. A 
second PCR-based DNA amplification reaction was carried 
out using a primer pair of AccIII primer 2 and cassette 

20 primer C2 with the DNA fragment obtained as a template. 

Base sequencing of the about 0.5 kb amplified DNA fragment 
obtained demonstrated that the fragment encodes a portion 
of the protein whose N-terminal amino acid sequence was 
determined in (1). On the other hand, the putative 

25 molecular weight of the AccIII restriction endonuclease 
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protein is about 35,000, and the gene encoding the protein 
is assumed to be about 1 kb in length. It was therefore 
expected that the base sequence of the AccIII restriction 
endonuclease gene could be determined, provided that 
5 information on the accurate base sequence of the 

5 ' -terminal region to which AccIII primers 1 and 2 
annealed, and information on the remaining about 0.5 kb 
base sequence in the 3' region, were available, in 
addition to the above-described information on the about 

10 0.5 kb base sequence. 

With this in mind, to determine the accurate base 
sequence of the 5 ' -terminal region of the AccIII 
restriction endonuclease gene region, AccIII primer 3, 
shown by SEQ ID NO: 22 in the Sequence Listing, and AccIII 

15 primer 4, shown by SEQ ID NO: 23 in the Sequence Listing, 

were synthesized. Next, a PCR-based DNA amplification 
reaction was carried out using a primer pair of AccIII 
primer 4 and cassette primer CI with the EcoRI cassette 
library obtained in (3) as a template. A PCR-based DNA 

20 amplification reaction was carried out using a primer pair 

of AccIII primer 3 and cassette primer C2 with a portion 
of this reaction mixture. Similarly, with the Bglll, 
EcoT14l, EcoT22l, Hindlll, PstI, Sail and Xbal cassette 
libraries as a template respectively, the above two-step 

25 PCR-based DNA amplification reaction was separatedly 
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carried out. Of the DNA fragments amplified, the 
shortest, i.e., the about 1 kb DNA fragment was obtained 
with the EcoT14l cassette library as a template, the base 
sequence thereof was determined. 
5 Furthermore, to determine the about 0.5 kb sequence 

on the 3 ' side of the gene region assumed to encode the 
AccIII restriction endonuclease protein, AccIII primer 5, 
shown by SEQ ID NO: 24 in the Sequence Listing, and AccIII 
primer 6, shown by SEQ ID NO: 25 in the Sequence Listing, 

10 were synthesized. A PCR-based DNA amplification reaction 
was carried out using a primer pair of AccIII primer 5 and 
cassette primer CI with the EcoRI cassette library 
obtained in (3) as a template. A PCR-based DNA 
amplification reaction was carried out using a primer pair 

15 of AccIII primer 6 and cassette primer C2 with a portion 
of this reaction mixture. Similarly, with the Bglll, 
EcoT14I, EcoT22l, Hindlll, PstI, Sail and Xbal cassette 
libraries as a template respectively, the above two-step 
PCR-based DNA amplification reaction was carried out 

20 separatedly. 

Of the DNA fragments amplified, the about 0.5 kb and 
about 0.8 kb DNA fragments each obtained with the EcoT22I 
and Hindlll cassette libraries as a template respectively 
were subjected to a base sequencing. Combining the 

25 results for the three base sequences determined, the base 
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sequence information on an about 1.6 kb DNA fragment was 
obtained. Its base sequence is shown by SEQ ID NO: 26 in 
the Sequence Listing. Furthermore, searching for an open 
reading frame ( ORF ) capable of encoding the protein 
5 demonstrated the presence of 0RF1 at base numbers 558 

through 1442, a portion thought to be 0RF2 at base numbers 
1588 through 1434, and a portion thought to be ORF3 at 
base numbers 1 through 535. Encoding the protein whose 
N-terminal amino acid sequence was determined in ( 1 ) , 0RF1 

10 was deemed to be the AccIII restriction endonuclease gene. 
The base sequence of the AccIII restriction endonuclease 
gene is shown by SEQ ID NO: 2 in the Sequence Listing, 
wherein the fourth base, as counted from the 5' terminus, 
is C. The amino acid sequence deduced from the base 

15 sequence is shown by SEQ ID N0:1 in the Sequence Listing, 

wherein the second amino acid, as counted from the N 
terminus, is Leu. The direction of translation of ORF2 is 
opposite that of ORF1 and 0RF3 . 

20 (5) Construction of the plasmid pCRA19 

Next, a plasmid for expression of the AccIII 
restriction endonuclease gene was constructed. First, to 
obtain the gene, the primer Acc-RL, shown by SEQ ID NO: 27 
in the Sequence Listing, and the primer Acc-RR, shown by 

25 SEQ ID NO: 28 in the Sequence Listing, were synthesized. A 
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restriction endonuclease Ncol recognition sequence site 
was introduced into both primers. Using the above 
described primer pair, it is possible to cut out the 
AccIII restriction endonuclease gene from an amplified DNA 
5 fragment obtained by PCR with the genomic DNA of the Acc 

bacterium as a template, using the restriction 
endonuclease Ncol, and the translation codon frames 
coincide with each other under control of the SP6 promoter 
when the gene is inserted into the Ncol site of the 

10 pACE611 vector. On the other hand, a use of this primer 

pair results in replacement of the fourth base, as counted 
from the 5' terminus of the AccIII restriction 
endonuclease gene, from C to G, and of the second amino 
acid, as counted from the N terminus of the protein 

15 encoded by the gene, from Leu to Val. Using the above 

primer pair with the genomic DNA of the Acc bacterium 
obtained in (2) as a template, a PCR-based DNA 
amplification reaction was carried out. This DNA fragment 
was completely digested with the restriction endonuclease 

20 Ncol (produced by Takara Shuzo ) , after which it was 

subjected to agarose gel electrophoresis; a DNA fragment 
of about 900 bp size was recovered. Next, this DNA 
fragment was inserted into the Ncol site of the pACE611 
vector so that it was located downstream of the SP6 

25 promoter. 
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This recombinant DNA was introduced into Escherichia 
coll JM109 to yield transf ormants . Thirty transf ormants 
were randomly selected, and each inoculated to 5 ml of an 
LB medium containing 30 ug/ml chloramphenicol and cultured 
5 at 37 °C. When the OD 600 value of the cell culture broth 

reached 0.6, IPTG was added to a final concentration of 2 
raM to increase the plasmid copy number, followed by 
further cultivation at 37 °C for 2 hours, after which 
plasmid DNA was prepared from each culture. Each plasmid 

10 DNA obtained was simultaneously cleaved with the 

restriction endonucleases Hindlll and Xbal, followed by 
confirmation of the length of the resulting DNA fragments 
by agarose gel electrophoresis; a plasmid thought to 
contain the gene region inserted in the right direction 

15 was detected. This plasmid, designated pCRA19, was 
introduced into Escherichia coll JM109 to yield a 
transf ormant , which was designated Escherichia coli 
JM109/pCRA19 . 

20 (6) Construction of expression system for the AccIII 

restriction endonuclease gene 

First, the SP6 RNA polymerase gene was introduced 
into Escherichia coli JM109 to construct a phage vector 
allowing further expression of the gene. Specifically, by 
25 inserting an SP6 RNA polymerase gene fragment obtained by 
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BamHI-Hindlll digestion of pSP6-2 into the BamHI-Hindlll 
site within the multicloning site of a commercial product 
of the phage vector M13mpl8 (produced by Takara Shuzo ) , 
the SP6 RNA polymerase expression phage M13sp6 was 
5 constructed. 

Next, the transformant Escherichia coll JM109/pCRA19 
was inoculated to 5 ml of an LB medium containing 30 ug/ml 
chloramphenicol and cultured at 37 °C. When the OD 600 value 
of the cell culture broth reached 0.6, IPTG was added to a 

10 final concentration of 2 mM to increase the plasmid copy 

number, followed by further cultivation at 37°C for 2 
hours. These cells were then infected with the phage 
M13sp6 to express the protein encoded by the AccIII 
restriction endonuclease gene, followed by further 

15 cultivation at 37 °C for 16 hours. A 11 mg portion of the 
wet cells obtained was suspended in 180 pi of cell 
disruption buffer E [20 mM Tris-HCl ( pH 7 . 5 ) , 10 mM 
2-mercaptoethanol ] , after which the cells were disrupted 
by ultrasonication, followed by centrif ugation (18000 g, 

20 10 minutes) to separate the solid and liquid. 

Determining the activity of the supernatant under the 
activity determination conditions shown on the data sheet 
attached to the AccIII restriction endonuclease produced 
by Takara Shuzo demonstrated the production of the AccIII 

25 restriction endonuclease in an amount of about 8000 units 
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per gram of wet cells, a level about 16 times that 
obtained with the Acc bacterium, per unit weight of wet 
cells. Neither activity of restriction endonucleases 
other than AccIII nor AccIII modification enzyme activity 
5 was noted in the supernatant obtained. Regarding the 

AccIII restriction endonuclease gene inserted into the 
plasmid pCRA19, it was demonstrated that the fourth base, 
as counted from the translation initiation base, was 
replaced from C to G, upon DNA amplification by PCR, 

10 resulting in the replacement of the second amino acid, as 
counted from the N-terminus of the translated protein, 
from Leu to Val, and that the protein possesses AccIII 
restriction endonuclease activity. 

An isolation/mass production system for the AccIII 

15 restriction endonuclease gene in the absence of AccIII 

modification enzyme was thus developed. 

(7) Isolation of the AccIII modification enzyme gene 
Modification enzyme genes and restriction 

20 endonuclease genes are often closely located. With this 
in mind, to determine the 0RF2 region deduced in (4) 
above, the base seguence of an about 1.4 kb DNA fragment 
obtained using the EcoRI cassette library as a template, 
out of the amplified DNA fragments prepared to determine 

25 the 3 ' region of the AccIII restriction endonuclease gene 
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in (4) above, was determined. Next, to determine the ORF3 
region deduced in (4) above, the base sequence of an about 
2.2 kb DNA fragment obtained using the Bglll cassette 
library as a template, out of the amplified DNA fragments 
5 prepared to determine the 5 1 -terminal region of the AccIII 
restriction endonuclease gene in (4) above, was 
determined. Combining these base sequences and the base 
sequence of the about 1.6 kb DNA fragment containing the 
AccIII restriction endonuclease gene region determined in 

10 (4) resulted in the information on the about 4.2 kb base 

sequence shown by SEQ ID NO: 29 in the Sequence Listing. 
The AccIII restriction endonuclease gene was located at 
base numbers 1913 through 2797, 0RF2 at base numbers 3712 
through 2789, and ORF3 at base numbers 691 through 1890. 

15 Of these ORFs, 0RF2 proved to contain a portion highly 

homologous to the conserved region among modification 
enzymes in a known restriction modification system. 

Next, to obtain the 0RF2 region, an about 1.1 kb 
ORF2-containing portion was cut out using EcoT22l and Hpal 

20 (produced by Takara Shuzo) from an amplified DNA fragment 

obtained by two-step PCR using a primer pair of Acc primer 
6 and cassette primer CI and another primer pair of Acc 
primer 5 and cassette primer C2 in the respective steps, 
with the EcoRI cassette library as a template, and was 

25 inserted into the Smal site downstream of the lac promoter 
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in the pUC118 vector. If this recombinant plasmid 
contains the AccIII modification enzyme gene, and if the 
AccIII modification enzyme can be expressed in Escherichia 
colt, the DNA in the culture of the transf ormant obtained 
5 by introducing this recombinant plasmid into Escherichia 
coll JM109 would undergo methylation by the AccIII 
modification enzyme and acquire resistance to cleavage by 
the AccIII restriction endonuclease . 

Because the pUC118 vector used to construct this 

10 recombinant plasmid has no AccIII restriction endonuclease 
recognition sequence, however, it is inappropriate to use 
this recombinant plasmid by itself to confirm AccIII 
modification enzyme activity. On the other hand, there is 
an AccIII restriction endonuclease recognition sequence in 

15 the plasmid pSTV29, which can be co-present with this 

recombinant plasmid in Escherichia coli JM109. With this 
in mind, to utilize pSTV29 as an index of expression of 
the AccIII modification enzyme, the above recombinant 
plasmid and pSTV29 were both introduced into Escherichia 

20 coli JM109 to yield transf ormants . Three transf ormants 
were each cultured at 37 °C for 16 hours in 2 ml of an LB 
medium containing 100 ug/ml ampicillin, 30 pg/ml 
chloramphenicol and 2 mM IPTG, after which plasmid DNA was 
prepared from each culture. 

25 The DNA thus prepared is available as a mixture of 
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pSTV29 and "the above-described recombinant plasmid. When 
each DNA sample was subjected to a digestion with the 
AccIII restriction endonuclease , the pSTV29 in all samples 
exhibited resistance to the AccIII restriction 
5 endonuclease activity, demonstrating the insertion of the 
AccIII modification enzyme gene into the recombinant 
plasmid contained in the DNA sample. Furthermore, when 
the DNA sample was simultaneously cleaved with the 
restriction endonucleases Hindlll and Xbal, followed by 

10 analysis of the length of the resulting DNA fragments by 

agarose gel electrophoresis, the presence of 0RF2 in the 
recombinant plasmid was confirmed. ORF2 was thus proven 
to be the AccIII modification enzyme gene. The base 
sequence of the AccIII modification enzyme gene obtained 

15 and the amino acid sequence deduced therefrom are shown by 
SEQ ID NO: 14 and SEQ ID NO: 13 in the Sequence Listing, 
respectively . 

The structure of the AccIII restriction modification 
system gene demonstrated according to the present 
20 invention is shown in Figure 16, wherein M, R and the 

arrow represent the modification enzyme gene, the 
restriction endonuclease gene, and the orientation of ORF, 
respectively. 

25 INDUSTRIAL APPLICABILITY 
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The present invention provides a plasmid vector 
capable of introducing into a host an exogenous desired 
gene encoding a protein lethal or harmful to the host, and 
a method for being capable of efficiently expressing the 
5 protein using the plasmid vector for the first time. A 
method for being capable of isolating a restriction 
endonuclease gene which constitutes a restriction 
modification system without co-existence of a modification 
enzyme gene, which has been difficult in the prior arts, 

10 is also provided. Furthermore, an AccIII restriction 

endonuclease gene and an AccIII modification enzyme gene 
are isolated by the present invention, and, from 
Escherichia coll transformed with the plasmid containing 
the gene, it is possible to easily obtain an AccIII 

15 restriction endonuclease or an AccIII modification enzyme 

available in the genetic engineering at a desired purity 
of an enzyme preparation compared to the prior method for 
purification . 



SEQUENCE LISTING 

SEQ ID NO:l 
SEQUENCE LENGTH: 29 5 
SEQUENCE TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FEATURE: 2 (Val or Leu) 
SEQUENCE DESCRIPTION: 

Met Xaa Pro Leu Asp Lys Asp Leu Gin Lys Ala Lys lie Ser lie 

15 10 15 

Thr Asp Phe Phe Glu lie Thr Asn Arg Val Leu Asp Tyr Phe Pro 

20 25 30 

Asn Val lie Asn Asn Thr Val Glu Lys Gly Asp Tyr Leu lie Ser 

35 40 45 

Ser Ser Asn lie Ala Gly Thr lie Lys Phe Leu Arg Pro lie Asn 

50 55 60 

Arg Lys Leu Phe lie Gin Glu Lys Lys Val Phe Asn Asp Tyr Phe 

65 70 75 

Gin Lys Leu lie lie Val Phe Glu Asn lie Arg Asn Lys Lys Thr 

80 85 90 

Val Thr Glu Glu Asp Lys lie lie lie Asp Arg Val lie Tyr Thr 

95 100 105 

lie Gin Gin Ser lie Gly lie Gly Leu Asp Leu Met Val Asn Gin 

110 115 120 

Asn Ser Ala Arg Lys His Val Gly Asn Arg Phe Glu Glu Leu lie 

125 130 135 

Arg Val lie Phe Thr Glu lie Ser Val Ser Asn Lys Arg Thr Val 

140 145 150 

Leu Gin lie Pro Tyr Glu Thr Asp Glu Gly Gin Lys lie Tyr Lys 

155 160 165 

Cys Glu Asn Asp Leu lie lie Ser Pro Phe Glu Asn Val Glu Ser 



Thr Asn Lys His Leu Asp Glu Asn Glu lie Val Val Ser lie Lys 

185 190 195 

Thr Thr Ser Lys Asp Arg Met Gly Lys Met Phe lie Asp Lys lie 

200 205 210 

Leu Leu Glu Arg Phe Val Lys His Pro Gin Lys Val lie Gly lie 

215 220 225 

Phe Leu Asn Asp Val Gin Arg Lys Glu Asp Asn Asn lie Ser Phe 

230 235 240 

Thr Leu Val Ser Gly Leu Phe Met Val Tyr Thr Lys Phe Leu Thr 

245 250 255 

Thr Leu Glu Gly lie Tyr Tyr Leu Asp Pro Pro Pro Asn Ala Leu 

260 265 270 

Lys Leu Pro Tyr Ser Asn His Met Lys Arg Phe Ser Asp Leu lie 

275 280 285 

Thr Glu Asp Leu Glu Lys Leu Phe Ser Ser 

290 295 



SEQ ID NO: 2 

SEQUENCE LENGTH: 88 5 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULE TYPE: Genomic DNA 

SEQUENCE DESCRIPTION: 

ATGSTACCAC TGGATAAAGA TTTACAAAAA GCAAAGATTT CAATTACTGA TTTTTTTGAA 60 
ATTACAAATA GAGTTTTAGA TTATTTCCCC AATGTAATCA ATAATACAGT TGAAAAAGGA 120 
GATTATTTAA TATCCTCATC AAATATTGCT GGAACAATAA AATTCCTAAG ACCAATCAAT 180 
AGAAAGTTAT TTATTCAGGA AAAAAAAGTT TTCAATGATT ATTTTCAAAA ACTGATTATA 240 
GTTTTTGAAA ATATAAGGAA CAAAAAAACT GTAACAGAGG AAGATAAAAT TATTATTGAT 300 
AGGGTAATTT ACACAATACA GCAATCTATT GGAATTGGTT TAGATTTAAT GGTTAATCAA 3 60 
AATAGTGCTA GAAAGCACGT TGGTAACCGA TTTGAAGAAT TAATTAGAGT CATTTTTACA 420 
GAAATATCAG TATCGAATAA AAGAACTGTA TTACAAATTC CATATGAAAC TGATGAAGGA 480 
CAGAAAATTT ACAAATGCGA GAATGACCTC ATTATTTCTC CTTTTGAAAA TGTAGAATCT 540 
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ACAAACAAAC ATCTAGATGA AAATGAGATT GTTGTTTCAA TAAAGACAAC ATCAAAAGAT 600 

AGGATGGGAA AAATGTTTAT AGATAAAATT TTACTTGAAA GGTTTGTTAA ACACCCTCAA 660 

AAAGTTATAG GGATTTTCCT CAATGATGTA CAAAGAAAAG AAGACAACAA TATCAGCTTT 720 

ACACTTGTTT CAGGATTATT TATGGTGTAT ACTAAATTCT TAACTACTCT TGAAGGGATC 780 

TATTATTTAG ATCCACCACC TAATGCATTG AAACTACCAT ATTCTAATCA TATGAAAAGA 840 

TTTTCAGATT TAATTACAGA AGACCTTGAA AAATTATTCT CCTCT 885 



SEQ ID NO: 3 
SEQUENCE LENGTH: 215 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA ) 
SEQUENCE DESCRIPTION: 

TATGGATATG TTCATAAACA CGCATGTAGG CAGATAGATC TTTGGTTGTG AATCGCAACC 60 
AGTGGCCTTA TGGCAGGAGC CGCGGATCAC CTACCATCCC TAATGACCTG CAGGCATGCA 120 
AGCTTGCATG CCTGCAGGTC ATTAGGTACG GCAGGTGTGC TCGAGGCGAA GGAGTGCCTG 180 
CATGCGTTTC TCCTTGGCTT TTTTCCTCTG GGACA 215 

SEQ ID NO: 4 
SEQUENCE LENGTH: 215 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TATGTCCCAG AGGAAAAAAG CCAAGGAGAA ACGCATGCAG GCACTCCTTC GCCTCGAGCA 60 
CACCTGCCGT ACCTAATGAC CTGCAGGCAT GCAAGCTTGC ATGCCTGCAG GTCATTAGGG 120 
ATGGTAGGTG ATCCGCGGCT CCTGCCATAA GGCCACTGGT TGCGATTCAC AACCAAAGAT 180 
CTATCTGCCT ACATGCGTGT TTATGAACAT ATCCA 215 



SEQ ID NO: 5 

SEQUENCE LENGTH: 28 
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SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA ) 
SEQUENCE DESCRIPTION: 

AGATCTAGAG CAAACAAAAA AACCACCG 28 

SEQ ID NO: 6 
SEQUENCE LENGTH: 24 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GGTCTAGATC CCAGAGGAAA AAAG 24 

SEQ ID NO: 7 
SEQUENCE LENGTH: 100 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

CTCGAGATTT AGGTGACACT ATAGAATACG GAATTGTGAG CGGATAACAA TTCCAAGCTT 
CACAGGAAAC AGACCATGGC TTAAGTAACT AGTGAATTCG 

SEQ ID NO: 8 
SEQUENCE LENGTH: 100 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 
CGAATTCACT AGTTACTTAA GCCATGGTCT GTTTCCTGTG AAGCTTGGAA TTGTTATCCG 60 
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CTCACAATTC CGTATTCTAT AGTGTCACCT AAATCTCGAG 



SEQ ID NO: 9 
SEQUENCE LENGTH: 27 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA ) 
SEQUENCE DESCRIPTION: 

AATCCCATGG AACGCTACGA ATCTCTG 27 



SEQ ID NO: 10 
SEQUENCE LENGTH: 29 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

CCGGCCATGG TTATTTTTGA CACCAGACC 29 



SEQ ID NO: 11 
SEQUENCE LENGTH: 26 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TAACTTGAAT CCATGGGTTC TCACCG 26 



SEQ ID NO: 12 
SEQUENCE LENGTH: 29 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
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MOLECULE TYPE: Other nucleic acid (synthetic DNA ) 
SEQUENCE DESCRIPTION: 
TACTCAGTAG CCATGGCTCT CATAGACCG 



SEQ ID NO: 13 
SEQUENCE LENGTH: 308 
SEQUENCE TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Asn Glu lie Ala Phe Asp Asn Tyr Ser Tyr lie Pro Lys Leu 

15 10 15 

Lys Leu Tyr Ser Glu lie Glu Leu Lys Pro Phe Phe lie Ser Lys 

20 25 30 

Asn Gly Ser Leu Phe Asn Val Asp Ala lie Asp Phe Leu Arg Lys 

35 40 45 

Leu Glu Ser Asn Ser Val Asp Leu lie Phe Ala Asp Pro Pro Tyr 

50 55 60 

Asn lie Lys Lys Ala Glu Trp Asp lie Phe Ser Ser Gin Asn Glu 

65 70 75 

Tyr Leu Glu Trp Ser Lys Glu Trp lie Met Glu Ala His Arg Val 

80 85 90 

Leu Lys Asp Asn Gly Ser Leu Tyr Val Cys Gly Phe Ser Glu lie 

95 100 105 

Leu Ala Asp lie Lys Phe lie Thr Ser Lys Tyr Phe His Ser Cys 

110 115 120 

Lys Trp Leu lie Trp Phe Tyr Arg Asn Lys Ala Asn Leu Gly Lys 

125 130 135 

Asp Trp Gly Arg Ser His Glu Ser lie Leu Leu Leu Arg Lys Ser 

140 145 150 

Lys Asn Phe lie Phe Asn lie Asp Glu Ala Arg lie Pro Tyr Asn 

155 160 165 

Glu His Thr Val Lys Tyr Pro Gin Arg Thr Gin Ala Glu Ser Ser 



Gin Tyr Ser Asn Ser Lys Lys Gin Tyr lie Trp Glu Pro Asn Pro 

185 190 195 

Leu Gly Ala Lys Pro Lys Asp Val Leu Glu lie Pro Thr lie Ser 

200 205 210 

Asn Gly Ser Trp Glu Arg Ser lie His Pro Thr Gin Lys Pro Val 

215 220 225 

Glu Leu Leu Lys Lys lie lie Leu Ser Ser Ser Asn Lys Asp Ser 

230 235 240 

Leu lie Leu Asp Pro Phe Gly Gly Ser Gly Thr Thr Tyr Ala Val 

245 250 255 

Ala Glu Ala Phe Gly Arg Lys Trp lie Gly Thr Glu Leu Asp Lys 

260 265 270 

Asn Tyr Cys Leu Glu lie Gin Lys Arg Leu Lys Asp Glu Ser Met 

275 280 285 

lie Asn Arg lie Phe Ser Gly Asp Asp Asp Ser Asn Ser Gin Asn 

290 295 300 

Arg Arg Lys Lys Leu Arg Gly Glu 

305 



SEQ ID NO: 14 

SEQUENCE LENGTH: 924 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULE TYPE: Genomic DNA 

SEQUENCE DESCRIPTION: 

GTGAATGAAA TAGCGTTTGA TAATTACAGT TATATACCAA AATTAAAACT TTATTCGGAA 60 
ATCGAGCTTA AACCATTTTT TATTTCAAAA AACGGTTCAC TTTTCAATGT TGATGCTATT 120 
GATTTTTTAA GAAAATTAGA GAGTAATTCT GTGGATTTAA TTTTTGCAGA TCCACCTTAT 180 
AACATTAAAA AGGCAGAGTG GGATATTTTT TCTTCTCAAA ATGAATATCT CGAATGGAGT 240 
AAAGAATGGA TAATGGAAGC TCATAGAGTT TTAAAAGATA ATGGCAGTTT ATATGTTTGT 300 
GGCTTTTCAG AAATTCTGGC AGACATAAAA TTTATCACTT CAAAATATTT TCACAGTTGT 360 
AAATGGTTGA TTTGGTTCTA TAGAAACAAG GCAAATTTAG GTAAAGATTG GGGACGTTCA 420 
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CACGAAAGTA TACTGTTATT AAGAAAATCT AAAAATTTTA TTTTTAATAT TGATGAGGCA 480 

CGAATCCCGT ATAATGAGCA TACAGTTAAA TATCCACAAA GAACCCAGGC CGAATCTTCG 540 

CAATATTCGA ACTCAAAAAA GCAATATATT TGGGAGCCAA ACCCATTAGG AGCTAAGCCA 600 

AAAGATGTTT TGGAGATTCC CACAATTTCA AATGGTTCTT GGGAAAGAAG TATTCACCCT 660 

ACGCAAAAGC CAGTAGAATT GCTTAAAAAA ATAATTTTAT CTTCATCTAA TAAAGATAGT 720 

TTAATTCTTG ATCCATTTGG TGGTTCGGGA ACTACATATG CTGTTGCGGA AGCTTTTGGC 780 

AG A A A AT GG A TTGGAACAGA GTTAGATAAA AATTATTGTC TGGAAATTCA AAAGCGATTG 840 

AAAGACGAAA GTATGATCAA CAGGATTTTT TCAGGCGATG ATGATTCAAA TTCTCAAAAT 900 

AGAAGAAAAA AATTAAGAGG AGAA 924 

SEQ ID NO: 15 
SEQUENCE LENGTH: 29 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TCGAGATTTA GGTGACACT A TAGAAT ACA 2 9 

SEQ ID NO: 16 
SEQUENCE LENGTH: 29 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

AGCTTGTATT CTATAGTGTC ACCTAAATC 29 

SEQ ID NO: 17 
SEQUENCE LENGTH: 54 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
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SEQUENCE DESCRIPTION: 

TCGAGATTTA GGTGACACTA TAGAATACGG AATTGTGAGC GGATAACAAT TCCA 



54 



SEQ ID NO: 18 
SEQUENCE LENGTH: 54 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA ) 
SEQUENCE DESCRIPTION: 

AGCTTGGAAT TGTTATCCGC TCACAATTCC GTATTCTATA GTGTCACCTA AATC 54 

SEQ ID NO: 19 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: 

Met Leu Pro Leu Asp Lys Asp Leu Gin Lys Ala Lys lie Ser lie 
15 10 15 

Thr Asp Phe Phe Glu 
20 

SEQ ID NO: 20 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
FEATURE: 6, 9, 12 ( inosine ) 
SEQUENCE DESCRIPTION: 

ATGTTNCCNY TNGAYAARGA YYT 23 
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SEQ ID NO: 21 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
FEATURE: 9 ( inosine ) 
SEQUENCE DESCRIPTION: 
AAGGATTTNC ARAARGCNAA RAT 



SEQ ID NO: 22 
SEQUENCE LENGTH: 30 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

TAAATCTAAA CCAATTCCAA TAGATTGCTG 30 



SEQ ID NO: 23 
SEQUENCE LENGTH: 30 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

CAAATCGGTT ACCAACGTGC TTTCTAGCAC 30 



SEQ ID NO: 24 
SEQUENCE LENGTH: 30 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
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SEQUENCE DESCRIPTION: 
GAACTGTATT ACAAATTCCA TATGAAACTG 



30 



SEQ ID NO: 25 
SEQUENCE LENGTH: 30 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA ) 
SEQUENCE DESCRIPTION: 

GACAGAAAAT TTACAAATGC GAGAATGACC 30 

SEQ ID NO: 26 

SEQUENCE LENGTH: 1588 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULE TYPE: Genomic DNA 

SEQUENCE DESCRIPTION: 

CCATGGCACA CGTTTCAAAA AAGAAAT CCT CGAAGTCAAA TATGATGAGA AAAACATCTC 60 

AGACATCCTG CATATGACGG TGGATGAAGC ATTGGAATTT TTCTCGGAAA ATCACGAAGA 120 

AAAAATTGTA ACCAAACTAA AACCTTTGCA GGACGTTGGT TTGGGTTATC TTCAGTTAGG 180 

CCAGTCCTCC TCTACTCTTT CCGGCGGTGA AGCCCAAAGA GTGAAGCTCG CCTCTTTCCT 240 

TGT GAAAGGT GTAACGACGG AAAAAACGTT ATTTGTTTTT GATGAACCAT CAACAGGATT 300 

ACATTTCCAC GACATTCAAA AATTACTGAA ATCACTTCAG GCACTGATAG AATTAGGGCA 3 60 

TTCGGTTGTA GTGATTGAGC ATCAGCCGGA TATTATCAAA TGCGCCGATT ACATCATCGA 420 

TGTCGGACCC AATGCCGGAA AATACGGTGG CGAAATTGTT TTCACAGGAA CTCCGGAAGA 480 

TTTGGTAAAA GAGAAAAAGT CGTTTACAGG GAAGTATATT AAGGAGAAGT TAAAGTAATT 540 

TATTTATATT TGAAGTTATG CTACCACTGG ATAAAGATTT ACAAAAAGCA AAGATTTCAA 600 

TTACTGATTT TTTTGAAATT ACAAATAGAG TTTTAGATTA TTTCCCCAAT GTAATCAATA 6 60 

ATACAGTTGA AAAAGGAGAT TATTTAATAT CCTCATCAAA TATTGCTGGA ACAATAAAAT 720 

TCCTAAGACC AATCAATAGA AAGTTATTTA TTCAGGAAAA AAAAGTTTTC AATGATTATT 780 

TTCAAAAACT GATTATAGTT TTTGAAAATA TAAGGAACAA AAAAACTGTA ACAGAGGAAG 840 

ATAAAATTAT TATTGATAGG GTAATTTACA CAATACAGCA ATCTATTGGA ATTGGTTTAG 900 
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ATTTAATGGT TAATCAAAAT AGTGCTAGAA AGCACGTTGG TAACCGATTT GAAGAATTAA 9 60 
TTAGAGTCAT TTTTACAGAA ATATCAGTAT CGAATAAAAG AACTGTATTA CAAATTCCAT 1020 
ATGAAACTGA TGAAGGACAG AAAATTTACA AATGCGAGAA TGACCTCATT ATTTCTCCTT 1080 
TTGAAAATGT AGAATCTACA AACAAACATC TAGATGAAAA TGAGATTGTT GTTTCAATAA 1140 
AGACAACATC AAAAGATAGG ATGGGAAAAA TGTTTATAGA TAAAATTTTA CTTGAAAGGT 1200 
TTGTTAAACA CCCTCAAAAA GTTATAGGGA TTTTCCTCAA TGATGTACAA AGAAAAGAAG 12 60 
ACAACAATAT CAGCTTTACA CTTGTTTCAG GATTATTTAT GGTGTATACT AAATTCTTAA 1320 
CTACTCTTGA AGGGATCTAT TATTTAGATC CACCACCTAA TGCATTGAAA CTACCATATT 1380 
CTAATCATAT GAAAAGATTT TCAGATTTAA TTACAGAAGA CCTTGAAAAA TTATTCTCCT 1440 
CTTAATTTTT TTCTTCTATT TTGAGAATTT GAATCATCAT CGCCTGAAAA AATCCTGTTG 1500 
ATCATACTTT CGTCTTTCAA TCGCTTTTGA ATTTCCAGAC AATAATTTTT ATCTAACTCT 1560 
GTTCCAATCC ATTTTCTGCC AAAAGCTT 1588 



SEQ ID NO: 27 
SEQUENCE LENGTH: 25 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

AT ATTTGAAG CCATGGTACC ACTGG 2 5 



SEQ ID NO: 28 
SEQUENCE LENGTH: 26 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Oyther nucleic acid (synthetic DNA) 
SEQUENCE DESCRIPTION: 

GATGATTCAA ATTCTCACCA TGGAAG 2 6 



SEQ ID NO: 29 

SEQUENCE LENGTH: 4146 

SEQUENCE TYPE: nucleic acid 




STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: Genomic DNA 
SEQUENCE DESCRIPTION: 
AGATCTGGTC ATCCCAAACA AAAATCTTTC 
AGGCGAAAGT ATGAGCGAAT GGAAAAAGGA 
TCACAAGCCT TATCATCAAC TCACAAAAGA 
AACCAGAAGT TTCCCAAGTA TTGATAATTT 
GATCCAATAC CGCGTAATGC TTTCGCGCTA 
AGG AT T ACG A TTGCGGGAAG AAACAAGCTG 
TTTGATTGAA TTACCTTTGG ATGAACTCCT 
CCACGACAGA GAAAT TGCC A AACGCCTGAC 
GACGAAAGTC GGCCTTGGAT ATCTGACTTT 
AGAAAGCCAG AGAATCAATC TGGCGACAGC 
ATTTTGGATG AGCCGAGCAT TGGTCTGCAT 
CTCAAACAAC TCCGCGATTT GGGAANTACC 
ATGCTTGCGG CAGNTTACAT TATAGATATT 
CTTGTTTTCA GCGNGGATTA TAAAGAGATG 
CTGAATGGCG AACTGAAAAT AGAAGTTCCT 
GCAATAAAAG GTGCCCGCCA GAATAATTTA 
TGTCTGACAG TTATCACAGG CGTTTCTGGA 
ATGACCAATG CCATCCAGAT CCAACTGGGA 
GTGGAATTCC CGAAAAAGCT GATCCAGAAT 
AAATCGTCCC GCTCCAACCC CGTGACTTAT 
TTTGCGAAAC AAAAATCCGC AAAAATCCAG 
GTGGATGGCG GAAGATGTGA CGAGTGCAAA 
TTTATGGCGG ACATCGAGCT GGAGTGTGAG 
ATCCTCGAAG TCAAATATGA TGAGAAAAAC 
GAAGCATTGG AATTTTTCTC GGAAAATCAC 
TTGCAGGACG TTGGTTTGGG TTATCTTCAG 
GGTGAAGCCC AAAGAGTGAA GCTCGCCTCT 
ACGTTATTTG TTTTTGATGA ACCATCAACA 
CTGAAATCAC TTCAGGCACT GATAGAATTA 
CCGGATATTA TCAAATGCGC CGATTACATC 
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Kjlj 111 Av_*ljAA 


GATGCAGTCG 


CTTCCTGGAA 


60 


Ail l^A 1 Lftftrt 


AAAbULrtrtAb 


ATTTCCCAAT 


120 




1 1 LL 111 ubn 


AAGGCGATAA 


180 


TTTCAAAATG 


LI 1 IoAAVjjAIjA 


A1L1 1 1 AL AA 


240 


TCGTGGGAAA 


ACACTTTGCC 


LLbA 1 1 LjL-LjA 




GGTGAAGATT 


GACGGACACA 


ALA 1 1 L-AI^j 1 1^ 




GCCATTGATC 


AAAAGCTTAA 


ArtL 1 Ij/inLo 1 


420 


1 1 AHj AAA 1 


GAAACGAGAT 


TAGAATTCCT 


480 


GAACCGAACA 


TCCAACACGC 


TTTCCGGAGG 


540 


TTGGGAAGTT 


CGCTGGTTGG 


TTCTATTTAT 


600 


TCCCGCGATA 


CAGAAAATCT 


GATTGGTGTC 


660 


GTGATTGTTG 


TAGAACACGA 


CGAAGATGTG 


720 


GGCCCNGNAG 


CGGGCTACCT 


TGGTGGCGAT 


780 


CTGAAGTNTN 


ATACTTTAAC 


CGCAAAATAC 


840 


GAAAAACGAA 


GAAAACCGAA 


GGAATTCATC 


900 


AAAAATATTG 


ACGTTGATGT 


TCCGTTAGAA 


960 


AGCGGGAAAT 


CCACTTTGAT 


GAAGGAAGTG 


1020 


ATGGGCGGCA 


AAAAAGCCGA 


TTACGATTCG 


1080 


ATCGAACTGA 


TTGACCAGAA 


CCCAATCGGG 


1140 


CTGAAAGCTT 


ACGACGATAT 


CCGGGATCTT 


1200 


GGTTACAAAC 


CGAAGCATTT 


CTCCTTCAAT 


1260 


GGCGAAGGTA 


TCATTACCGT 


ATCAATGCAG 


1320 


CATTGCCATG 


GCACACGTTT 


CAAAAAAGAA 


1380 


ATCTCAGACA 


TCCTGCATAT 


GACGGTGGAT 


1440 


GAAGAAAAAA 


TTGTAACCAA 


ACTAAAACCT 


1500 


TTAGGCCAGT 


CCTCCTCTAC 


TCTTTCCGGC 


1560 


TTCCTTGTGA 


AAGGTGTAAC 


GACGGAAAAA 


1620 


GGATTACATT 


TCCACGACAT 


TCAAAAATTA 


1680 


GGGCATTCGG 


TTGTAGTGAT 


TGAGCATCAG 


1740 


ATCGATGTCG 


GACCCAATGC 


CGGAAAATAC 


1800 
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GGTGGCGAAA TTGTTTTCAC AGGAACTCCG GAAGATTTGG TAAAAGAGAA AAAGTCGTTT 1860 
ACAGGGAAGT ATATTAAGGA GAAGTTAAAG TAATTTATTT ATATTTGAAG TTATGCTACC 1920 
ACTGGATAAA GATTTACAAA AAGCAAAGAT TTCAATTACT GATTTTTTTG AAATTACAAA 19 80 
TAGAGTTTTA GATTATTTCC CCAATGTAAT CAATAATACA GTTGAAAAAG GAGATTATTT 2040 
AATATCCTCA TCAAATATTG CTGGAACAAT AAAATTCCTA AGACCAATCA ATAGAAAGTT 2100 
ATTTATTCAG GAAAAAAAAG TTTTCAATGA TTATTTTCAA AAACTGATTA TAGTTTTTGA 2160 
AAATATAAGG AACAAAAAAA CTGTAACAGA GGAAGATAAA ATT ATT AT TG ATAGGGTAAT 2220 
TTACACAATA CAGCAATCTA TTGGAATTGG TTTAGATTTA ATGGTTAATC AAAATAGTGC 2280 
TAGAAAGCAC GTTGGTAACC GATTTGAAGA ATTAATTAGA GTCATTTTTA CAGAAATATC 2340 
AGTATCGAAT AAAAGAACTG TATTACAAAT TCCATATGAA ACTGATGAAG GACAGAAAAT 2400 
TTACAAATGC GAGAATGACC TCATTATTTC TCCTTTTGAA AATGTAGAAT CTACAAACAA 24 60 
ACATCTAGAT GAAAATGAGA TTGTTGTTTC AATAAAGACA ACATCAAAAG ATAGGATGGG 2520 
AAAAATGTTT ATAGATAAAA TTTTACTTGA AAGGTTTGTT AAACACCCTC AAAAAGTTAT 2580 
AGGGATTTTC CTCAATGATG TACAAAGAAA AGAAGACAAC AATATCAGCT TTACACTTGT 2640 
TTCAGGATTA TTTATGGTGT ATACTAAATT CTTAACTACT CTTGAAGGGA TCTATTATTT 2700 
AGATCCACCA CCTAATGCAT TGAAACTACC ATATTCTAAT CATATGAAAA GATTTTCAGA 2760 
TTTAATTACA GAAGACCTTG AAAAATTATT CTCCTCTTAA TTTTTTTCTT CTATTTTGAG 2820 
AATTTGAATC ATCATCGCCT GAAAAAATCC TGTTGATCAT ACTTTCGTCT TTCAATCGCT 2880 
TTTGAATTTC CAGACAATAA TTTTTATCTA ACTCTGTTCC AATCCATTTT CTGCCAAAAG 2940 
CTTCCGCAAC AGCATATGTA GTTCCCGAAC CACCAAATGG ATCAAGAATT AAACTATCTT 3000 
TATTAGATGA AGATAAAATT ATTTTTTTAA GCAATTCTAC TGGCTTTTGC GTAGGGTGAA 3060 
TACTTCTTTC CCAAGAACCA TTTGAAATTG TGGGAATCTC CAAAACATCT TTTGGCTTAG 3120 
CTCCTAATGG GTTTGGCTCC CAAATATATT GCTTTTTTGA GTTCGAATAT TGCGAAGATT 3180 
CGGCCTGGGT TCTTTGTGGA TATTTAACTG TATGCTCATT ATACGGGATT CGTGCCTCAT 3240 
C A AT AT T AAA AATAAAATTT TTAGATTTTC TTAATAACAG TATACTTTCG TGTGAACGTC 3300 
CCCAATCTTT ACCTAAATTT GCCTTGTTTC TATAGAACCA AATCAACCAT TTACAACTGT 33 60 
GAAAATATTT TGAAGTGATA AATTTTATGT CTGCCAGAAT TTCTGAAAAG CC AC A A AC AT 3420 
ATAAACTGCC ATTATCTTTT AAAACTCTAT GAGCTTCCAT TATCCATTCT TTACTCCATT 3480 
CGAGATATTC ATTTTGAGAA GAAAAAATAT CCCACTCTGC CTTTTTAATG TTATAAGGTG 3 540 
GATCTGCAAA AATTAAATCC ACAGAATTAC TCTCTAATTT TCTTAAAAAA TCAATAGCAT 3600 
CAACATTGAA AAGTGAACCG TTTTTTGAAA TAAAAAATGG TTTAAGCTCG ATTTCCGAAT 3 6 60 
AAAGTTTTAA TTTTGGTATA TAACTGTAAT TATCAAACGC TATTTCATTC ACAAATGAAT 3720 
CAATCTGCTG TTGTGTATAA ACCCTGTAAT TATTAATAGG ATGTCTTAAA CTTTTGAATT 3780 
TTCCAGAATT ATCCCATCTT CCTTAATGTC TCAGAGTTAA CATCTAATAA TTTCGCCGCT 3840 
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TCTTTTATTG ATAAATAATC ATCCATATCT TACACAACAT TACACAAGTT TATACAGCAA 3900 
ATATAAATAT TTTTTATACA TTGTAAAAAT TTTATTTACT TTTATTTTGT TCAATTGTCT 39 60 
CAATAAATAG TTAATCGAAA TACATTTTGA ATATGATAAA ATTGACTCCA ACAAATCTAA 4020 
CACAATGACA TTAAAACCAA TAAAAACGGA AGAAGATTAC AATCAGGTTT TAGAAAGACT 4080 
TTCACAAATT TTCGACGCTA AACCAAATAC CAAAGATGGA GATGAATTGG GAAATCTTGG 4140 
GAATTC 4146 

SEQ ID NO: 30 

SEQUENCE LENGTH: 23 

SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULE TYPE: Genomic DNA 

SEQUENCE DESCRIPTION: 

ATTTAGGTGA CACTATAGAA TAC 23 



